wo 01/64835 



PCTAJSOl/04927 



SEQ ID 


/Accession No. 


species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


13563 


AFl 18082 


Homo sapiens 


PRO 1902 


83 


86 


13564 


G01246 


Homo sapiens 


Human secreted protQin, SEQ ID NO: 


234 . 


97 








5327- 






13565 


Y59778 


Homo sapiens 


Human normal ovarian tissue derived 


66 


100 








protein 55. 






13566 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


60 








5909. 






13567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


96 








SEQ ID NO: 163. 






13568 


XS3703 


Homo sapiens 


nuclear protein 


1633 


99 


13569 


G02473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


242 


45 








6554. 






13570 


G00521 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


90 








4602. 






13571 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


110 


80 








RHAla(AF078683) 






13572 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


114 


46 








RHAla(AF078683) 






13573 


X92744 


Homo sapiens 


hBD-1 


290 


83 


13574 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


107 


68 








factor (BCGF). 






13575 


AFl 07406 


Homo sapiens 


GW128 


356 


100 


13576 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


101 


48 








sequence. 






13577 


AF220264 


Homo sapiens 


MOST-1 


95 


73 


13578 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


129 


68 








SEQ ID NO: 163. 






13579 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


76 


13580 


AF130051 


Homo sapiens 


PRO0898 


128 


64 


13581 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


142 


58 


13582 


AFl 18082 


Homo sapiens 


PRO 1902 


85 


51 


13583 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


114 


52 








encoded by gene 90. 






13584 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


54 








6536. 






13585 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


109 


64 


13586 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna. 


109 


77 






brucei 


complete virion genome. 






13587 


AF090901 


Homo sapiens 


PRO0195 


114 


53 


135S8 


G00397 


Homo sapiens 


Human secvQt&d protein, SEQ ID NO: 


95 


66 








4478. 






13589 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 








7437. 






13590 


AK000521 


Homo sapiens 


unnamed protein product 


1313 


100 


13591 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13592 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


62 








7867. 






13594 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


103 


56 








gene 100 clone HNFIU96. 






13595 


U16359 


Rattus 


nitric oxide synthase 


109 


76 






norvegicus 








13596 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


127 


88 








RHAla(AF078683) 






13597 


G00325 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


275 


100 








4406. 






13598 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


108 


63 








sequence. 






13599 


AFl 16661 


Homo sapiens 


PR01438 


128 


64 



499 



wo 01/64835 



PCTAJSOl/04927 



SEQW 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13600 


X55682 


Lycopersicon 

esculentum 


extensin (class 1) 


58 


43 


13601 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


45 


13602 


U28971 


Caenorhabditis 

elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


122 


92 


13603 


AF149419 


Oryctolagus 
cuniculus 


eye sodium bicarbonate cotransport 
protein NBC2 


126 


75 


13604 


AJ005567 


Mus musculus 


SPR2I protein 


63 


40 


13605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


150 


96 


13606 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


121 


60 


13607 


G02867 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6948. 


85 


60 


13608 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


57 


35 


13609 


AFl 07406 


Homo sapiens 


GW128 


108 


42 


13610 


D82345 


Homo sapiens 


NB thymosin beta 


167 


100 


13611 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


59 


13612 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


50 


13613 


Y32193 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 044150. 


490 


80 


13614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


57 


13615 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13616 


AFl 19900 


Homo sapiens 


PR02822 


95 


53 


13617 


W80406 


Homo sapiens 


A secreted protein encoded by clone 

dh40_3. 


124 


58 


13618 


S79978 


Homo sapiens 


prion protein, PrP {octapeptide 

repeats } 


88 


42 


13619 


AF220264 


Homo sapiens 


MOST-1 


130 


81 


13620 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


65 


13621 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


168 


85 


13622 


K0I664 


Drosophila 
melanogaster 


Bkm-like protein 


118 


65 


13623 


M86246 


Homo sapiens 


EHS-2 


100 


63 


13624 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


104 


46 


13625 


AF067205 


Homo sapiens 


vesicle transport related protein 


198 


69 


13626 


U39529 


Echinometra 
mathaei 


bindin 


72 


47 


13627 


S79410 


Mus musculus 


nuclear localization signal binding 

protein 


102 


52 


13628 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


123 


54 


13629 


K01664 


Drosophila 

melanogaster 


Bkm-like protein 


113 


46 


13630 


AF161536 


Homo sapiens 


HSPC051 


582 


100 


13631 


Y27854 


Homo sapiens 


Human secreted protein encoded by 

gene No. 10 L 


158 


87 


13632 


AF130089 


Homo sapiens 


PRO2550 


105 


66 


13633 


AFl 16661 


Homo sapiens 


PR01438 


116 


53 


13634 


AFl 16715 


Homo sapiens 


PR02829 


101 


60 



500 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: 


Accession N^o. 






Smith- 
Waterman 
Score 


% 

Identit 

y 


13635 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 17. 


139 


63 


13636 


U33547 


Homo sapiens 


MHC class II antigen 


154 


79 


13637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


107 


75 


13638 


AF090895 


Homo sapiens 


PRO0117 


87 


62 


13639 


G02532 


Homo sapiens 


Human secreted protein, SEQ BD NO: 

6613. 


131 


69 


13640 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


128 


60 


13641 


S58722 


Homo sapiens 


X~]inked retinopathy protein {C- 
terminal, clone XEH.Sc} 


128 


81 


13642 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


210 


67 


13643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


287 


76 


13644 


Y73483 


Homo sapiens 


Human secreted protein clone yI18_l 
protein sequence SEQ ID NO: 188. 


376 


88 


13645 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


130 


85 


13646 


Y87212 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:251. 


423 


97 


13647 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP2 


103 


86 


13648 


G03800 


Homo sapiens 


Human secreted protein, SEQ JD NO: 

7881. 


128 


55 


13649 


X58521 


Homo sapiens 


nucleoporin p62 


2610 


98 


13650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


69 


13651 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


111 


81 


13652 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


140 


60 


13653 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


121 


46 


13654 


U13066 


Nicotiana alata 


arabinogalactan-protein precursor 


92 


33 


13655 


S79410 


Mus musculus 


nuclear localization signal binding 

protein 


134 


50 


13656 


Y07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


156 


100 


13657 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


48 


13658 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


171 


80 


13659 


AK000017 


Homo sapiens 


unnamed protein product 


611 


100 


13660 


U90446 


Mus musculus 


RNAse L inhibitor 


3100 


99 


13661 


AE000882 


Methanothermob 
acter 

thermoautotrophi 
cus 


phosphoenolpyruvate synthase 


88 


38 


13662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


65 


13663 


KG 1664 


Drosophila 
melanogaster 


Bkm-like protein 


97 


34 


13664 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


77 


41 


13665 


Y94890 


Homo sapiens 


Himian protein clone HP02798. 


325 


98 


13666 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


139 


65 


13667 


AF090930 


Homo sapiens 


PRO0478 


119 


88 


13668 


AF130089 


Homo sapiens 


PRO2550 


120 


77 



501 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13669 


AC006693 


Caenorhabditis 

elegans 


Hypothetical protein W02H5.e 


165 


85 


13670 


W80293 


Homo sapiens 


Human translocation associated protein 
designated Gp25L-H. 


1003 


95 


13671 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


113 


91 


13672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


160 


60 


13673 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


178 


75 


13674 


AFl 18086 


Homo sapiens 


PRO 1992 


85 


75 


13675 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


132 


55 


13676 


AF220264 


Homo sapiens 


MOST-1 


133 


85 


13677 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


122 


51 


13678 


Y65416 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1577. 


467 


98 


13679 


Y16589 


Homo sapiens 


A protein that interacts with 
presenilins. 


2286 


99 


13680 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


104 


52 


136S1 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


13682 


X55686 


Lycopersicon 

esculentum 


extensin (class II) 


60 


56 


13683 


AF090944 


Homo sapiens 


PRO0663 


93 


90 


13684 


AL096770 


Homo sapiens 


bA150A6.2 (novel 7 transmembrane 
receptor (rhodopsin family) (olfactory 
receptor like) protein (hs6Ml-21)) 


178 


52 


13685 


AFl 16661 


Homo sapiens 


PR01438 


115 


60 


13686 


U62039 


Elephantulus 
edwardii 


reverse transcriptase 


86 


53 


13687 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


77 


36 


13688 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867- 


114 


60 


13689 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


53 


13690 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


107 


51 


13691 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


178 


85 


13692 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


94 


35 


13693 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


78 


13694 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


98 


50 


13695 


D63163 


Rattus sp. 


cyclin C 


111 


86 


13696 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


118 


95 


13697 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF07S683) 


124 


80 


13698 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


70 


13699 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


93 


58 


13700 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


100 


73 


13701 


M76744 


Homo sapiens 


BGP 


112 


52 



502 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13702 


Y15155 


Homo sapiens 


phosphorylase kinase beta-subunit 


206 


100 


13703 


AF090901 


Homo sapiens 


PRO0195 


100 


70 


13704 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


131 


79 


13705 


R95913 


Homo sapiens 


Nem-al thread protein. 


106 


36 


13706 


M15073 


Homo sapiens 


MHC HLA-DR-beta-1 chain 


74 


77 


13707 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


110 


35 


13708 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


150 


56 


13709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


89 


77 


13710 


X70775 


Chironoraus 
cingulatus 


Sp 1 2 gene horaologue 


85 


38 


13711 


X80265 


Hordeum 
vulgare 


structural protein 


96 


41 


13712 


AB007922 


Homo sapiens 


KIAA0453 protein 


147 


64 


13713 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


60 


13714 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


108 


65 


13715 


AFl 16661 


Homo sapiens 


PR01438 


120 


77 


13716 


AFl 18086 


Homo sapiens 


FRO 1992 


127 


64 


13717 


AK024455 


Homo sapiens 


FLJ00047 protein 


93 


56 


13718 


AFl 16661 


Homo sapiens 


PR01438 


137 


55 


13719 


W48352 


Homo sapiens 


Human breast cancer related protein 

BCFLTl. 


93 


59 


13720 


AF119851 


Homo sapiens 


PRO 1722 


94 


58 


13721 


G02538 


Homo sapiens 


Hmnan secreted protein^ SEQ ID NO: 
6619. 


124 


45 


13722 


Y27868 


Homo jsapiens 


Human secreted protein encoded by 
gene No. 107. 


117 


58 


13723 


AF161361 


Homo sapiens 


HSPC098 


115 


50 


13724 


Y95829 


Homo sapiens 


Native human Tie receptor signal 
peptide. 


108 


100 


13725 


AFl 18086 


Homo sapiens 


PRO 1992 


166 


75 


13726 


AFl 16636 


Homo sapiens 


PR01488 


95 


70 


13727 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


70 


13728 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


149 


60 


13730 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. 


118 


63 


13731 


U33547 


Homo sapiens 


MHC class II antigen 


123 


69 


13732 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


135 


88 


13733 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


75 


13734 


AF026204 


Caenorhabditis 
elegans 


C30E1.1 gene product 


102 


51 


13735 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


117 


63 


13736 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


112 


87 


13737 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


115 


58 


13738 


AF130089 


Homo sapiens 


PRO2550 


122 


70 


13739 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


124 


74 


13740 


AF130051 


Homo sapiens 


PRO0898 


85 


70 



503 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13741 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 
KSRP 


91 


51 


13742 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


150 


77 


13743 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


124 


82 


13744 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


91 


48 


13745 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619, 


116 


64 


13746 


AB001684 


Chlorella 
vulgaris 


ORF54d 


70 


66 


13747 


Y02886 


Homo sapiens 


Frsigment of human secreted protein 
encoded by gene 90. 


140 


60 


13748 


AF090901 


Homo sapiens 


PRO0195 


92 


39 


13749 


AF090895 


Homo sapiens 


PRO0117 


148 


67 


13750 


AF 119882 


Homo sapiens 


PR02492 


91 


45 


13751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


60 


13752 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


129 


65 


13753 


AF161356 


Homo sapiens 


HSPC093 


99 


48 


13754 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


70 


13755 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478. 


120 


51 


13756 


AF 130089 


Homo sapiens 


PRO2550 


142 


96 


13757 


AF130089 


Homo sapiens 


PRO2550 


132 


86 


13758 


G02532 


Homo sapiens 


Himian secreted protein, SEQ ID NO: 
6613. 


163 


50 


13759 


Y 19609 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


92 


60 


13760 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


154 


52 


13761 


U62040 


Elephantulus 
edwardii 


reverse transcriptase 


134 


51 


13762 


AB001684 


Chlorella 

vulgaris 


ORF49b 


100 


45 


13763 


AFl 18082 


Homo sapiens 


PRO 1902 


129 


50 


13764 


AF220264 


Homo sapiens 


MOST-1 


129 


66 


13765 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


50 


13766 


AF220264 


Homo sapiens 


MOST-1 


115 


71 


13767 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


139 


73 


13768 


AF130089 


Homo sapiens 


PRO2550 


97 


34 


13769 


U80739 


Homo sapiens 


CAGH26 


564 


100 


13770 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


55 


13771 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533 - 


102 


68 


13772 


AE004507 


Pseudomonas 
aeruginosa 


hypothetical protein of bacteriophage 
Pfl 


93 


44 


13773 


AB044885 


Canis familiaris 


dopamine receptor D4 


78 


52 


13774 


AF026246 


Homo sapiens 


HERV-E envelope glycoprotein 


108 


53 


13775 


S71805 


Homo sapiens 


RNA-binding protein^TLS/FUS-ERG 


126 


100 


13776 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


82 


13777 


AB015727 


Mus musculus 


truncated granzyme M 


86 


37 


13778 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


52 



504 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








6913. 






13779 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


137 


61 


13780 


M14123 


Homo sapiens 


neutral protease large subunit 


246 


55 


13781 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


99 


67 


13782 


AF 132972 


Homo sapiens 


CGI-3 8 protein 


902 


99 


13783 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


147 


63 


13784 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


68 


13785 


AF130051 


Homo sapiens 


PRO0898 


138 


72 


13786 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


95 


13787 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


92 


53 


13788 


U12206 


Homo sapiens 


unknown 


84 


42 


13789 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


118 


58 


13790 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


50 


13791 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


137 


76 


13792 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


104 


72 


13793 


AB006006 


Bos tauras 


neurocalcin alpha 


1000 


100 


13794 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


183 


95 


13795 


W34499 


Homo sapiens 


Obesity receptor C protein. 


120 


85 


13796 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


127 


67 


13797 


U4103S 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


182 


83 


13798 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


112 


71 


13799 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


91 


84 


13800 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


67 


13801 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


72 


13802 


AF 11 9900 


Homo sapiens 


PR02822 


131 


50 


13803 


AK024435 


Homo sapiens 


FLJ00025 protein 


117 


95 


13804 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


13805 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


48 


13806 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: " 
4478. 


98 


66 


13807 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


98 


66 


13808 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


150 


54 


13809 


M37679 


Mus musculus 


Ig heavy chain precursor 


70 


100 


13810 


AL451015 


Neurospora 
crassa 


putative protein 


96 


55 


13811 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


88 


77 


13812 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


71 
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NO: 


r^^«-vd9ltf II 1^ %Iw 




Description 


Sinith- 

Waterman 

Score 


/o 

Identit 

y 


13813 


AF130089 


Homo sapiens 


PRO2550 


93 


75 


13814 


KOI 664 


Drosophila 
melanogaster 


Bjkm-like protein 


119 


60 


13815 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


64 


13816 


AJ011435 


Blackstonia 
imperfoliata 


maturase 


93 


48 


13817 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


130 


76 


13818 


AFl 19851 


Homo sapiens 


PRO 1722 


141 


50 


13819 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


60 


13820 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


76 


72 


13821 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


135 


50 


13822 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


95 


52 


13823 


L10908 


Mus musculus 


Gcapl gene product 


103 


50 


13824 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


62 


13825 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


125 


83 


13826 


L10908 


Mus musculus 


Gcapl gene product 


96 


43 


13827 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7052. 


102 


64 


13828 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


53 


13829 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


70 


13831 


AP000060 


Aeropyrum 
pemix 


lOlaa long hypothetical protein 


79 


50 


13832 


AFl 19900 


Homo sapiens 


PR02822 


94 


46 


13833 


Yi9767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


145 


40 


13834 


AF2 18028 


Homo sapiens 


unknown 


117 


57 


13835 


LI 0908 


Mus musculus 


Gcapl gene product 


79 


45 


13836 


Z26876 


Homo sapiens 


ribosomal protein 


129 


93 


13837 


Y36495 


Homo sapiens 


Fragment of human secreted protein, 
encoded by gene 27. 


124 


76 


13838 


AF044311 


Homo sapiens 


gamma-synuclein 


603 


99 


13839 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


100 


82 


13840 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


46 


13841 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


64 


13842 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


92 


62 


13843 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protem (nuclear ma-binding 
protein) 


93 


60 


13844 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


63 


13845 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


145 


79 


13846 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787L 


102 


50 


13847 


G03469 


Homo sapiens 


Hrnnan secreted protein, SEQ ID NO: 
7550. 


101 


68 
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SEQID 
NO: 


Accession No. 


Species 


Bescription 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13848 


G03240 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7321. 


107 


43 


13849 


U05313 


Trypanosoma 
brucei 


CR3 


98 


40 


13850 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


100 


65 


13851 


AF159055 


Homo sapiens 


leucine zipper-like protein 


103 


55 


13852 


R59842 


Homo sapiens 


ApoE4Ll protease. 


99 


71 


13853 


Y01158 


Homo sapiens 


Secreted pTOtein encoded by gene 1 8 
clone HCACJ81. 


142 


65 


13854 


G03714 


Homo sapiens 


Human secreted protGui, SEQ ID NO: 
7795. 


119 


79 


13855 


R59842 


Homo sapiens 


ApoE4Ll protease. 


93 


69 


13856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


114 


76 


13857 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


94 


57 


13858 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


105 


58 


13859 


AF289022 


Homo sapiens 


formiminotransferase cyclodeaminase 
form C 


467 


100 


13860 


AF078844 


Homo sapiens 


hqp0376 protein 


488 


100 


13861 


AB032436 


Homo sapiens 


brain-specific Na-dependent inorganic 
phosphate cotranspoiter 


2968 


100 


13862 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


91 


72 


13863 


AF130079 


Homo sapiens 


PR02852 


131 


63 


13864 


Z29701_cdl 


Homo sapiens 


29-MAY-1998 Wild-type human c-Src 
tyrosine kinase cDNA. 


2380 


100 


13865 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


80 


33 


13866 


AL050318 


Homo sapiens 


dJ977B1.5 (myosin regulatory light 
chain 2, smooth muscle isoform) 


904 


100 


13867 


U37690 


Homo sapiens 


RNA polymerase II subunit 


358 


100 


13868 


X13923 


Homo sapiens 


cytochrome c oxidase subunit VIb (AA 
1-86) 


491 


100 


13S69 


L13848 


Homo sapiens 


RNA helicase A 


6669 


99 


13870 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


133 


46 


13871 • 


K02064 


Bos taurus 


cytochrome c oxidase subunit IV 
precursor EC 1.9.3.1 


96 


72 


13872 


D3I763 


Homo sapiens 


ha0946 protein is Kruppei-related. 


4606 


96 


13873 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


83 


43 


13874 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


117 


70 


13875 


Z52203_cdl 


Homo sapiens 


17-SEP-1998 Human PR0217 protein 
encoding cDNA, UNQ191. 


2135 


99 


13876 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


106 


64 


13877 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


50 


13878 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


104 


60 


13879 


L25404 


Brassica napus 


cyclin 


124 


42 


13880 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


139 


59 


13881 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


70 


64 
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S£Q ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13882 


AB000098 


Rattus 

norvegicus 


MIPP65 


905 


50 


13883 


Z95114 


Homo sapiens 


bK212A2.1 (TNF-inducible protein 
CO 12-1 (similar to apolipoprotein L)) 


1639 


100 


13884 


AF132984 


Homo sapiens 


Nuclear pore complex interacting 
protein NPIP 


551 


82 


13885 


AF121862 


Homo sapiens 


Sorting nexin 13 


1453 


99 


13886 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


83 


69 


13887 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


109 


77 


13888 


AF217197 


Homo sapiens 


FBP interactmg repressor 


2725 


99 


13889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


187 


94 


13890 


AFl 18086 


Homo sapiens 


PRO 1992 


85 


62 


13891 


M58664 


Homo sapiens 


Signal transducer CD24 


371 


97 


13892 


AK023443 


Homo sapiens 


Unnamed protein product 


125 


45 


13893 


AB040972 


Homo sapiens 


KIAA1539protem 


2271 


99 


13894 


U03750 


Escherichia coli 


DeaD 


95 


48 


13895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


73 


53 


13896 


AF078851 


Homo sapiens 


Secretogranin III 


2384 


99 


13897 


U62039 


Elephantulus 
edwardii 


Reverse transcriptase 


109 


48 


13898 


X07816 


Human 
herpesvirus 4 


Epitope C13 (57 AA) 


55 


53 


13899 


AFl 16661 


Homo sapiens 


PR01438 


146 


48 


13900 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


77 


60 


13901 


Y28643 


Homo sapiens 


Human serine proteeise inliibitor from 
cDNA clone HETDK50. 


2191 


100 
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TABLES 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine» 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V==VaIine, 
W-Tryptophan, Y-Tyrosine> 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1 


13902 


A 


J 


114 


434 


AIFKCVEGMFRIAMVNVCFVSSGSLIil * 
PLT Y/GVYDEWTHFAYMTIDLLE I PITG 
SHPWLNAIiFCLEAP \ WISPNTGS FAYP 
VYPKSIilAHDFAVEATMPYIRLSST 


2 


13903 


A 


2 


124 


466 


KSNIPNLGDCGWESLFNRXQSWRSSLAV 
\NDTYSSKKSNAETFTFHADI»CTIiSDKD 
RP I TIQTALAELVKHKPKATYEQLIAVL 
DEF/ANFLKKWWKAYDKENLFCEEG*KL 
CAASN 


3 


13904 


A 


4 


1 


427 


EGFLELIiRTRNHSNS QLQLTTG I GLFLN 
EGIiKLVDKFIaEDV* K* YHSETFTVNFSD 
TE * AMKHI WD YVEKGTQGKIVDIjVKELD 
RDTVFDLANYIFFKGKWDRPFEVNDTEB 
EDFHVDQVSTVIsFEPIMKIiLSXMIilsrrHPC 
FKL 


4 


13905 


A 


5 


1 


464 


KI KSF YAS KDTIKRMRVTDWKKI FAY* I 
SDKELIFTIiRTLKT* *K*GKQPNLKISrGQ 
E 1 * VP I S PQD IQ IAHK\HLEGWSTSLVS 
ELAPCEAPV/RHPLTGLTIAGLQGFGEA 
GRIiVRGRWGC*W/VHPF*KYI /WQFLSK 
LHISLPYDPTTPLLGTCSR 


5 


13906 


A 


6 


308 


3 


HF VIHS KHDIoAIAHLGI Y/ PREMKT* VH 
TKTCT* IFTVALSVIARAWNQPGRPLCS 
EWIi\KYMVHTME * HS A I KRLNYRYKNNC 
VNIiFLGITLSEKSQTQNVI 


6 


13907 


A 


7 


587 


2 


FLTRETGDPTGRSSSHANTQSRFFPDDP 
PG\ PLNNLGNTHGCGRRAGRCPGTGPDG 
P \ AGCGGPRCWPSGHLAATGD * GP S CGR 
LGANRGEAGPAGFTACSPLSGCRTPYTH 
HFPASRMS CHLNCAS PRTYRSQGNRGGE 
RVAQGSQGAGGERGAKSQVPVPAPARNK 
DPAKCRKPRNRRPGNSGPWRAYRRQR 


7 


13908 


A 


8 


1 


474 


RI LNEEHGKYEGLHE * EVKWHLY I KS PA 
FTDLHLCYQKDMNGISTSASSPAVGTVG 
MDMDEDDDFSKWNFYYSPHSYPDK^LTI 
FKTESRVRESDEVTQIKVNWDEEVISGL 
LTSLKDNVLKATGVL YDYAYK\ YLCEHT 
RSTLKEESLKLERNIiQNH 


8 


13909 


A 


9 


3 


539 


SQ CSPF I S PACSLTAIiEEETEALRVHPR 
LCLSPNIAPSSGPPRPPEIiAPCPPSSQA 
GLRTCHSWVKGLiHQPLPVASGMKSTFCniT 
KTYTCPYPPP/PPLCPNHSPNAIiTIjPDS 
VTHAVPFE /L * SPSAPPSSTA* IIiGS PS 
\CGASPCNHPHSHPGICPTPPGLWPVCP 
CAPRAWQRIJGTRQT 


9 


13910 


A 


10 


2 


453 


RL. * LGLE YALLVWGTPKV*H*GGFP 1 YY 
YI VLLLS YALHQVTE YSMYVS IMAPNAK 
VSDPLIVGTYMTLLNTVSNLIGNWPSTV 
SLWLVNPLTVKECV* TS YQNCCTPDAAE 
LCKKLiGGS CVTALDGYYVES 1 1 CVS I AF 
V/W*VFLVHKFK 


10 


13911 


A 


U 


20 


475 


KMGVPPLLMSDPNRFLFPKNFLREKTIS 
PPKTF*PIiKIWVKGQWLNFLGFPGFKI 
FFPVFKFFFFFFFF/RDRVSLYHPGWSA 
VSQSELTAALTSPGSGDQVILPSQPPK* 
ENHLNLGGRGCSEPRLPRAEFIiDLRSFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQD) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue ox 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine9 l^lsoleucine, 
K— Lrysme, L=Leucine, M— Methionine, 
N=Asparagine, P=Proline5 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '^=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IIVIFP 




J3912 


A 


12 


392 


63 


HIRADPGL/EPRPSARTGLGPELGCCTMW 
KLGDAGSATQSLGGSQLSWQLSRREQEL 
EQARWEAQ^QVETLGRVAREKEALAKEH 
AGLAVQLAA\AEHEGRTVSEEATHLQ 


12 


13913 


A 


13 


18 


338 


aphpqylqmppmllpprtgpgqfslps s 
rhgghle/gkehetsvtlcggepppqtp 
prdpdpgp*arrapcprrpt*ahprals 
raapqepralagprarhp i cpgsl 


13 


13914 


A 


14 


2 


371 


TTKQ* KDNP IQTGAKDLKRHFCKEDMQM 
ASNHT\KRSLTSIiVSREMHIKTTRMAGI 
KKSDNNKHW*GRGE IRT/LI»RCWWDCKV 
VQLLWKTVWQFL * GIjYQYI PHDPVISRL 
GISLILYIALRTF 


14 


13915 


A 


15 


443 


1 


SRTFLDTDMKSMRGFKAS*NRQTLLLGA 
NAAGD\KIiKAMIiTNHSENRKILQNYVKC 
TLPMLYKWNNKAWMTAYLFTTQCTEYFK 
PTVETHCSEKKIS F \ KILLTI YNAPGHL 
RALMQIYKEIHWFVPVNIPSILQPIVQ 
NVISTFKS 


15 


13916 


A 


16 


375 


38 


HQQWGFLKKTDPTIiliCLjQETHFR/ CKDT 
KRLKVRRY/ QPNSNQKRAR/VPILIEDK 
IDFKTKKIPMMIKDLTIINI*ASNTRVP 
KSMKQKLAGLKEEMDNSVIMVGGFSYPV 
SIRK 


16 


13917 


A 


17 


448 


189 


NRDRVSIiCCPGWS *TPGLKRSFHLGLPK 
CWDYRRDIY*LL\FARHTQMISTHS*QT 
TNTCS YPAF* KSRP * EPGYNTTHTPHS S 
DML 


17 


13918 


A 


18 


1 


426 


GMSHHARSIil INQLFKKCSTLFVL/REM 
QIKSLLASSSSSSSRNSVSGQGFETVGT 
HCGQKCiCLVQPF\WJ<AVW*yLIiKIiWFI 
IiFLIYNRNAHLEDTCENVDRALFVXVKN 
WKLSKYPPGVKWRIKLWYSHSVKSSTAV 
IiPKV 


18 


13919 


A 


19 


2 


423 


YFETFQPLLRRLGYQGTFFPP\KPWSSC 
LGIEYNSGPDSCA* FFLQNQIRLWSAN 
I RLMAMTLKTNQVAI AQFLE CKE SDQQF 
CIGVTHVKARTGWA*F*SAQGCDIiLQNL 
QNVTQ/GAKI PLW* GDFNAEPTQEA* K 
HF 


19 


13920 


A 


20 


10 


443 


liKVDSGDSEVRYVFILQHITLLMCSAYM 
NQLLNI FVRPSLLAVALHMTPGFTKEDV 
YSCFRFLRDVFADEFIFLPGNTL*DFEE 
SCYLLCKSEAIQVTTKDILFTEKGNTVL 
* FLVGLFKPF/VESYHIICKSIiIiDEK*A 
PFIEEP 


20 


13921 


A 


21 


11 


426 


VL VETNNLRMGQVTM * PELPDM/ S PDAW 
TLSDSPSQKIGHAQQ\KYSIIKWKWYTE 
DWAQACIiEDTSKYEQVTQIPMAPNDATL 
/ Fbo/AJtiLiAc lAib'^GAPbUo^ljKlNTXitalj 
ICDGST\KPSSTN*KQWIALQSHAGLN 
MKEC 


21 


13922 


A 


22 


1147 


1768 


QLGMSHGYSSKSMPQKLMCFLPNHHLQK 
GHECLPKVLHSNPPPIIKYLALQDLMLL 
SQYSPSRRQEVFSLSQPGGHPHNWTAIS 
RECLNLLNGMTQKrilLYQBAAATNGRVS 
SSYPVBPKKLNSPEETAFQTPKSSQMPR 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQJDD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

msN 

09/515,1 
26 


JPredicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H-Histidine, I=Iso]eucine, 
K=Lysine, Leucine, M^Metbionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V==Valine, 
W==Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possibJe 
nucleotide insertion 














PS VP PLVKTS LFS S KhS TPD WS PFGT P 
FGSS VMNRMAG I FDVNTCYGS * AT * HS E 
¥l\LiLl<IIjSl,L,YH\mhCGVFLhTTWYVSWl 
LFKIYATKAHVFPVQPPFAEGS*VPSKS 
VK*QSSP\LIKYLALQDLMLLSQYSPSR 
RQEVFS LS QPGGHPHNWTAI SRE CLNLL 
NGMTQKIiILYQEAAATNGRVSSSYPVEP 
KKLNSPEETAFQTPKSSQMPRPSVPPLV 
KTSLFS S KLS TPDWS PFGTPFGS S VMN 
RMAGl FDVNTCYGS PQSPQLIRRGPRLW 
TSAS 


22 


13923 


A 


23 


421 


3 


KIIFRAPKG*KSPSMPGFKA*KDRLILL 
LGTNADNDFKRKPM/ljrjyHSKNPRAIjKN 
Y I KS I Q P LYE * KKA* MTAHLLKWVTE Y 
FKPIVGTYC*KI/SFKILLLNDDAPG\H 
PKALIEMYREINIFMSANTLFILQPMEK 
GVX 


23 


13924 


A 


24 


226 


2 


THERTHSKXIHVIIIKITHPLNPSILRP 
QTTA*IKWRDLGSLQPLP/PG/LKRFSY 
IiILiPSSWDYRCPPIiRPANFCIF 


24 


13925 


A 


25 


2 


611 


FFFFLLLGLLHQIPDVSPTGKYTTLLiPI) 
MI ILMISGIKEVIY/DHK*HGRQNS/VR 
NTKLL * QDS WDTFKWKEVNVGDT VKASN 
GEIjLPADTVPMC\YIATSNPDRETN/ VK 
TRQALPETASV 


25 


13926 


A 


26 


1 


443 


atqwrpslvpasaenvwkarsfaagiha 

LGGTNINDAMLMAVQLLDSINQDELVT* 
GS VLLI ILLTDGEPTAWETNLMS IQNNV 
REAVRGRYSLFCLGLGFDVCYAFIiEKLA 
LDNGGLARRIHEDSDSALHMQDFYXQEV 
AHPLVTAV 


26 


13927 


A 


27 


2 


359 


KAQSKQWIiPRGGSGPVK/ARVDEESAKV 
IVRVYWEAQGILLIDFLKGQRRTIYAYS 
ENILRKPEL* *RNAWGSFSRVIjHHDRAP 
AHFSHQ/TRAIG/REF* *KTIRHPHy/S 
PNIiNPLDAFCF 


27 


13928 


A 


28 


2 


427 


WRKTVKGHFTDQCRKHKAM/LGN* ihfl 
EYHIHKHSIQYSGIQATQEKFGGLKTSY 
CSKD*YITVIjDLTLKISNKDVSN*HENP 
ISMYKN/ IPTVIYGSKVIKDRKLKA*!* 
L-t' / titilSixlS^r * Yi?JJx r x VJjijNifyii JPLiKLiW 
RIITEI 


28 


13929 


A 


29 


47 


412 


HCDVLLASSRYTCILPYSDRDDGPQDQL 

KMS vdfrsrrtgs /wfpqnswghmgvcg 
wggagrtldlihlripmrglrsggflcr 

K KJj V b li b 1 * ii o P ij * JK.* JC\.G<j 
LTTVPSQL 


29 


13930 


A 


30 


336 


419 


rlscsgmisaycnlslpglsdpsnsasr 


30 


13931 


A 


32 


1 


408 


KIALKLRSOTSKISGY/MANIQKLIIFL 
YTSNEQIEFEIKNTIPFTLAPPKIK*LG 
lJ>JJ_ilJ\L. \ V yiiJjjciiivJ. X Jvb VMiUN J-JviJiljiNJiv 

WRSSYGKGKSKIKI1KINK*MDIACSWL 
RRIiKIVKI S VLHNLI YRFNI VP IE I P 


31 


13932 


A 


33 


412 


2 


QGHLSLQKFL\LPFVQLCPAPRGGVYRG 
RQDSLSCGGLHPF*ASGLLCLPTQASAM 
VGTPPPASLLLCSS ILDCCASNERGSVG 
VEPPEPGTGHNLLVCRLLRPLEKRS IRW 
KSAVGVARFSRCRPSWLPIiARKKNSP 



511 
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SEQ IP 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Ajanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=liysine» L=Leucine, M^'Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine9 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, '^^Stop codon, /^possible 
nucleotide deletion, V^posslble 
nucleotide insertion 


32 


13933 


A 


35 


304 


33 


KVWGEKVWYWQKMTQIVQWDRTESPQID 
N*SLTKEIQWRKDSIjFNKW*GNNWTAPF 
SS/RS1iNLiNKDXjTAVTK1KSKWVTDLNV 
KHKTIKLL 


33 


13934 


A 


36 


2 


424 


SKTYS IGHFTYEGKGTITSLWGKVNVED 
AVGETLGRLLVEYPWTQRFFDSFGKLSS 
ASAIMGNPFViCAHGBKGLTSLGDALQHL 
DDLKG\TFAQLTELHCDKLHVDPENFFL 
LG * VLVTVLALHF/SRRFTP/ELQAS /Wr 
PKM 


34 


13935 


A 


37 


2 


433 


NKRLPGPGF*KRPJ!TPPGEGGPARYFNPL 
GGPNREIPL.GPEV*THFGPQMRPPVFLK 
ILKISGAFWGPPVGPPTWGG^EGLNPRG 
PGYN*P/R/PPAPPPWG*SQAPFPKPAP 
PP/TGIKPNP 


35 


13936 


A 


38 


3 


425 


GAAQLLPVSLP SAQRAIDLQ I LTTPWEN 
ms IAS LQHKTT I GRS WL I TG P I SATS S 
TPTSCTVASP\TRGHV*RSASWKYDVLQ 
HSSHMWWHTAVYIiGEAFHQVHATGS CH 
HRVLS CCPITGRSGRDAL* QliLiPL/HAL 
TEA 


36 


13937 


A 


39 


2 


463 


DNEVRDIKLSSDHWPSKTNSFIiHSPGFL 
srfepqpas va/prp * sqqqsltpgkas t 
SIiWPPNPVPFVTSSLSLSALPGIjFLWLP 
SIPPIiPGSPFPSPE*PEVPLFPGPryHN 
PSPPD / PTVLEAHPDQAPLP \PGVPTAE 
QEIPTPAP*AHRPS/LPIjPP 


37 


13938 


A 


40 


223 


408 


RNTV*NIGTDKDFMT*TPQAIATKAQID 
KWNL lEPMS FC/ T YRETI IGVTDRYTRD 
KIFESGR 


38 


13939 


A 


41 


3 


427 


NLKMKSRTSKENIGEHIYDFDIGRFVNT 
* SKPERKRIjSFKTSVQEKTP *L*GKHK* 
MKLEP\liPyPKINSKWIKDl?VRSKAIKL 
IiEENRSES S IWQRML/ SIMPKAQAI KEK 
VDNLDFFKIKNCVSKDTIKKALDRPSER 
EKI 


39 


13940 


A 


42 


30 


448 


FEMREIIVAFHS ILIRYHGLINLRKFQ* 

mdrryskevqdlletmklfarivgpl*h 

DKFIESHALEFELLREIKRLQEYRTAGI 
TNFCSARTYDHIiKKTR*EERLKRTMLSE 
VLQYZQDSNACQQWLRRETY\ lESGPNP 


40 


13941 


A 


43 


1 


928 


LRQRMPPQGLQPCPWPPRYP/PDSPLGC 
PGGIjA\ PPNSPS WPKLASLAGSRGSSAP 
PGVWGLVATGA*PLQRPSAPPPCPR/LC 

T) f~* "D r^TV/I T3 

Jr (jK 111 1^ V-iYl V- F 


41 


13942 


A 


44 


2 


409 


SKSPDPTQQPLRGGSLTHSAPGPSLSQP 
LAQLTPPAFGPSTSSLFNLQKSSLSARH 
PQRKRRGGPSEPTPGSRPQDATVHPACQ 
I FPHYTP \ VWHILGPQRHTP * S VD /HPG 
ijDKRJjJjPlij i PCjPL. 1 bNbyPVWJ_jCXiTP 






A 




J 


1 JO 


J-itri'l ± JLjJr J. Ihljxli " /-I J. Xji\.r X nflMrl J. X X J.wrt.i\-ri 

IL»TNQK\NKARGITNIC 


43 


13944 


A 


46 


365 


2 


AWGSLRAPRSAVPEPGTPPVMGNALAHR 
QPSPCLDGLSCPSFLRG*NSPPHPGSPG 
LPKHA*APAAVSPWVDPDRTVQPPPPAP 
PST\D * PHCS PPCTPWCPRRLGSAPVMP 

GCPTASYPRR 


44 


13945 


A 


47 


1 


456 


aelselyeessdlqmd™pgegdlpkw\ 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
Zo 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid^ 
E=Glutamic Acid, F==:Phenylalanine, 
G=GJycine, H=Histidlne, I=:Isoleucine, 
K=L.ysine, lj=Leucine, M=Methionine, 
N='Asparagine, P^JProIine, 
Q=Glutaroine, R=Arginine, S^Serine, 
T=Threonine, V=Val!ne, 
W=Tryptophan, Y— Tyrosine, 
X=Unknowri, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EEATGTRAAI PWVPPATGAQQLEEEGPM 
ath \iiJiAyPiyiAAFilibr]5LKbijAJ>J(jPNA 
GQPSPGRRTSRAEDEA\EEFDDWEDDYD 
F\PREEPVKGARLRFLPPS * KTPPSFWE 
NRNTPLWGGLKIFY 


45 


13946 


A 


48 


116 


442 


PQNMGMERKPGPPKGLSWASGDLEDNLG 
VMSVGFLLSSPDDAVIWRGPRKTGLIKQ 
FLjRNGTGE KVD YFMGDT QLGT S D EHL S S 
VATWAKAHLGEPGAFTFPQRVSIjRE 


46 


13947 


A 


49 


434 


3 


FTVPGPLLTV I P I F I SKVP FKFS LPQ I F 
NFTPFGFPFQIRV^TIIPISLRFPRKKP 
LFPFF^NKVPLCSPYWNSIARSWVPAAS 
VFQV\KESFYLNLLSVPPQVPLNVFLTF 
FFFFFliGRDR/ SLPIjLPRMVWNS WAQAI 
IiPLWL 


47 


13948 


A 


50 


1 


482 


EKP YQCSDFGKAFNMKTQ \ L WHQGVHT 
GNNPYQCRECGKAFGRKEQLTAHLIAHA 
R*KPYGCSECGKAFSSKSYLVIHRRTHT 
GERPYECSSCDRAFCGKSQLI IHQITHS 
TENHYECNECENTYPRKASLKIHQKIDL 
GKKPF/ECNEWGKALLiK 


48 


13949 


A 


51 


1 


470 


REFLWQEGHRAFATVDEAGEEVIiQILDL 
YADVYEELiIiAIPVANGTKTDKDKLAGGD 
YTATIEAFICASGRAIQGGTSHLLGQNF 
SAMSE I VLEDPE I PGENQPAYANS WGLT 
TRTTWVWT I GHGDNMGL / LLT PRVA* VQ 
/MVl X PGGlPKAFhKRQKT 


49 


13950 


A 


52 


26 


448 


SPGTEREYRIGQQSVTGVTSVDDSNSYW 
RIRGKSATVCERGTPIK*GQPIRLirHVN 
TGRNLHSHHFTSPIiYGN/QVAIi*GDlVI 
IL*RRKQRriKGFTEEGIKLRPKEVSAFG 
DEGEGDYLDDWTVLCNGPxWVRDGEVRP 
NT 


50 


13951 


A 


53 


3 


495 


AMEVKAFABTHIRGFTLNDAAWSRIallT 
QVRRHYLKEAATTLKTVLDHQHTPSRIiA 
vtrviqalamkgdvXdni *VFHKMLNGL 
EDSIGLSKMDFINNIALAQI *nsnldas 
V*H2EWMliTS *NCirv ISPQyFGSAYJjFRQ 
VLEEQLEPTVEKISIMAERIiAKPLQ 


51 


13952 


A 


55 


1 


428 


QERGTKKEMEDRMTLEETKXEQILKtiEE 
KL,*ALQEEKHQLFLQLjKKVLHEEERRRR 
KSQSDLTTIiTSAAYQQSLTVHTGTHLLS 
VQGS PGGHNRPGTLMAADRAKQMYGPQV 
IjTTRRYVG I aaafagtpehgqfqgrpgg 
VYG 


52 


13953 


A 


56 


2 


453 


EDGDIiDAFSSDEGLTMAMSYLKDDIFRI 
YI TETQECRRYHRPPCAQE APClsrMVHP I 
VI CYGCYGPE VGTRYTCS VCPD YDLSSS 
CKGKGLHRGHTKLAFPS PFGHLS EGFSH 

SRWLRJWKHGIjIjRWS * WEMGPPGNWSPR 

r*A /GFIARLiGP 


53 


13954 


A 


57 


3 


435 


ELNTS I FRSRP lEGLGLNTVLTTDNSNL 
VI \NRIGI V\PSVTEKEYTDPSSDGTYA 
WKIFSHETITKAQILKLFLSYDYAVNNP 
WLAYPHYKSPEKCPS I ILHDRIiYYLNGL 
* CAAS AMVMIAIVTYNVAIiLAYHRWNGL 
TYMID 


54 


13955 


A 


59 


409 


1 


LCCEHRGKTVCVPRGKFTTLNAS INKER 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
- peptide 
sequence 


M 
eth 
od 


SEQIO 
NO: in 
USSN 
09y515,l 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
£>=G(utaniic Acid» F^Phenylalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
Kr^Lysine, Lr=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=<j^lutaniine, R=Arginine, S==Serine, 
T=Threonine, V=Va!ine, 
w— 1 ryptopnan, i — lyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SKISVIiSFRI,MKLKKEYVK*K* \EKEKI 

MINHQKRENIQITDFRNERTELPPDSTD 
I KKAIRKY * K* F YPNI F YNPEEMNKF 


55 


13956 


A 


60 


1 


393 


(jJM V b o C^CjJJHF L-hitj * ijC P \ VlJKV MS * Cj X * 
VPEBACTQCIGEDGVHHQFLES VJVPVHQ 
PCHICTSLNGRKDNCTTQTCPTGKASTC 
GLCBIiARItRQNADQCCPENECVCDPESC 
DLPPVPN*ERGLQSTLTNP 


56 


13957 


A 


61 


250 


1 


K 111 IJ C KR V Ij z J^JJK V S 1j U L ACj P C is V 

KAASTFQ\VKHFSLQSSWVYRHAPPGLA 

NFFHFLQRQVXiTTPPGLVLNSWVHAI 


57 


13958 


A 


62 


365 


3 


IKKKLIWRLFTFPKPSWGKKTKKGIPFP 
* GP KKI P G X KFPQG X KKAFKGNR * TLGK 
GN*KKKKKKKECGPGGDIPCS*IG/RNI 
VKXLIIPKAIYSFNTIHIKIPKTFFTEI 
ETTILKFL*NH ^ 


58 


13959 


A 


63 


245 


558 


FLPTQVIS YVKRAIjAEGAQI * CGEGVDK 
L S Xi P ARNQAGYFMIiP TS I TD I KDE S C CM 
TEEIFGPMTCDVPFDSEEDVIERANNVK 
YGIaAATVW/SSNVARV 


59 


13960 


A 


64 


106 


532 


ERACQSGTSGQGGVPRRAIGAPR/E/DA 
FTGAVYIYHGDAGGIVPQYSMHPFA*SI, 
YP SGQ / S VAARGNLSGDSFNCHN * GLQG 
I,LIiASSV*RPVMLLITDPIIYTTKKr/ISG 
PJS.v>-riii 


60 


13961 


A 


65 


2 


548 


VQKYPQQNRSCVPPVAEWAVPQSSRLKY 
RQLFNSHDKTMSGHLTGPQARTILMQSS 
LPQAQIiASIWNLSDIDQDGKLTAEEFIIi 
AMHLIDVAMSGQPLPPVLPPEYIPPSF/ 
RRVRSCSGISVISSTSVDQRLPEEPVFR 
DEQQQIiEKKIiPGTFEDKKRENS * RGNLE 
IiEKRRQALLBQQRQG 


61 


13962 


A 


66 


72 


952 


S SRT Y TTS LPLLKKE VTRKIRKY F I IiNN 
N*NTTYHNIiWDtiTKWFSGKLIiF*RSR* 
E YRKDKI SDIiCFYIQMIiE/QRQVI KPKV 
STRKEINKSRTQ*SGKETMERPKLEDKP 

H*KDNKGYIYANNFNILDQMDKFFERYV 
TKMD\QEQ I VSUSTI P I ADKSNA* LNNFP 
SSSSSSSSSSSS 




liyoi 




o7 


i3y 


4z2 


V jNVjiNjlJr'iiJr'Xj^ rLvjXoKnJin KJr 
KNKGGAKNQ * ASLGMKTPEAPAHSDKPR 
RRQHAAYSS*AMPFLGICISFSQCNL/C 
PPKLNV 


63 


13964 


A 


68 


274 


1 


NLKNKAVI TKTA*H * QKNTDQWNR I KNP 

NFNK*CWEN* ILTCKKMQLHP/SIiTTNT 

KLNSQWIKDL 


64 


13965 


A 


69 


22 


419 


KI IGEDGDT/ PLSEMAKSQRQ I FSKDLV 

TTT.nQTTKrn^mT TnTVK"T.WWP\ TCWTWTP 
jxxjl^o X j.iN\^vi> J. J.LJ J, X JXJjniriir \jxvy j. v x x 

FSSH*GTFSKIDHILLDKTYPNNL*-RTa 

IMQHMLSD I KVFKLE I STRKVTEKS *HT 

MSIiCNKTLEHPSGNEDASAYLK • 


65 


13966 


A 


70 


57 


423 


KDKNSQVTSEEEEQEGKIKENLNSWRDT 
R / CH/mGR \NIVRLS FLPKL IFIFNTV 
PIKIRTQFFMKLNKPVLKFIY*TKRTRI 
AKSLLKKK/RQBGKISPTNRRTYYEAYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQJ1> 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
Zo 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hi5tidine, I=Isoleucine, 
IC=Lysine, L^Leucine, IVI^IVtethionine, 
N— Asparagine, P^ProIine, 
Q==Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=XJn known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TLESIGASMPSLDLG 


66 


13967 


A 


71 


394 


I 


KHFRCMLQEDCRIIililiWFFSETGSPCAV 
SPRIECSSKIIVHSLiKLLGSSDPRASLS 
* VAGTI GVHHHTQP * GI / CNI * GSGRRR 
SEKQWIKEKNVSSEAKRRRTF*NE*SV* 
KGDYNKDQGLRLGTVAHAYN 


67 


13968 


A 


72 


485 


1707 


SALGLSQPRPPGASDAAPDSTPAGV*TA 
SSSAPAGEKDANQMEWARRDPGPLHQAA 
PAPH* SHLVRDKRAQERLRKTSRGPPHS 
HRSGPVDLSPRSSHSRDSTPVHGPYFRS 
APDPRGSAPAFRGP lATRGR/RPRAPRA 
TASHVPLSSSSWKEPEEATAPRF^ASAC 
MKWRLEAGCPGHRGVWPGPPSRRLFMWA 
TATGEKS TP S P FTG VWRFG VE ACT* AGS 
PTAHIQHSQRTWQPVPPCTNGSRSPWTS 
SYPKNQSKVGTVPKRGWACQPIPETAVR 
QAT I SQ * ARKGAARPHHQARQVEHSTQ/ 
QRPSGDSGKFDQRSLCEASTLLQPERPE 
VQGKICKFRV/GERRRTASPNSAVPEKR 
TRTQTMVGTAF I GCS GKWRRVYS VRR / R 
SSPRIPPPGVGSS\PLKTPPPP 


68 


13969 


A 


73 


1 


462 


QDHRSKSSHSNKRPSLASSLSENFFQRA 
AKPLQVNKWKKLYSTPLLAI PTCMGFGV 
HQDKYRFL VLP S LGRSLQS ALDVS PKHV 
LTKRS VLKVACRLLNALKS SLKMS WVYG 
RVT * KYLL* F/ YRTTNRGLL * SMGFAFR 
YLPHGDNPAHLERGN 


69 


13970 


A 


74 


307 


436 


LP*VGCITEGIPEQDMVRGKHKLLRQEK 
TRLIGPNCPGVINPGECKIGIMPGHIHK 
KGRIGIASRSGTLTYEAVHQTTQVGLGQ 
ibJjL.V(jrJ.LrL»iyc'i?JNLjJ. £ir ±U<^Lxili±r vWUciA 
TEGI ILIGENGA/WAEDNAAECL 


70 


13971 


A 


75 


1 


435 


EISDSKAQLAAMAIilXDTWBjRMlfCFSE* 

NHEPLRTHCALAASKLLKKPD\QAE\RE 

JtlLiL.libJji XtvoCjTlJNliyxLNoCiifij^rlv^^ 

RLKKALIXAHQGMDPSLQYRVFIEILNR 

YIYFYEKENDAVTIQVLNQLIQKiriBDL 

PNLESWK 


71 


13972 


A 


76 


58 


366 


EFPDLVKDMNLHIQEAQCIPNKINLNKL 
MNRHRI I /RLLNTTTKKRILKAAKQK*N 

S I * KI LK/ EKTQNCLPRVFCPVKI 


72 


13973 


A 


77 


1 


445 


YHETGCFLMGAIVDITLTFNTYVHFQGK 
MKGFSLLADPQEFWVDNSTSMSAPMLSG ' 
MGTFQHWSDIH\DNLSVTHVPFTDSACL 

liijXyir'ilx Ar UijJJixV iiitjiji. r iriyjNJbijJN WMK 
KLS SRT IHLTMPQLALQGS YDL * DLLDQ 
A*LSDILTP 


73 


13974 


A 


78 


53 


444 


ERGGYGAGPVAWQFLVPSTAPMLQSPPL 
GFAIDHTPPVPAPAN*APCPLPYAA*RT 

STPWLPDAFPCPLPTWD/RPPCLQPLLS 
SLPTPRLPFLCFLLLAPYAP 


74 


13975 


A 


79 


1 


353 


HIRTELDYYGLTWIIYSDT*EAYNYIY 
IMVT*NVYKPQLWNI FDRATMHSQDVRH 
HLLCIRLMLKNPKHHAV* /VLNGHYAFV 
SR/SPKHALVQYVQAFRTHPDEPLYSFC 
XGLTFIHM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN ' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
£>=Glutamic Acid, F^Fhenylalanine, 
G=:Glycine, H~Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioniney 
N=Asparagine, P=*Proline, 
Q^Glutamine, R~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


75 


13976 


A 


80 


1 


350 


Uxr xAJjijr Lit=>b(j/ CVAx vAIRUJMUC-lIjD 
ERFGSYCPTTCGIAYFIiSTYHIKVDKDL 
QTLEDILHQVEDKTS*FKQLIKAIQLTY 
NPDDSLKPNMIDAATLKSMKML* *IMTY 
EASILT 


76 


13977 


A 


81 


76 


386 


PAYPQVRGPAS TP AS CIRP TNARVLSTT 
PRGKS VAEAHS VS PS AHRGVTS VIKLWS 
APCRLH* YGA\KVRPNS \GCTP 


77 


13978 


A 


82 


1 


360 


ES I LQED I TVLNVGAPNNRAS KYMRQ / M 
IiVELQRQ IDES III CEDFNTL I S EMDS T 
RPKI S KD I VGLNGT IN* LD 1 1 D I YRLL * 
PTTAEKTFFSSSRGAFT*MDHIIjCYETH 
lYKFKLHE 


78 


13979 


A 


83 


39 


344 


WGILEGGEYIWHVGGGKSLRASGGLWSQ 
PPRQPDDSRPLVLVPVCHIiLPHWIGPTD 
LGHKRQ W / MG PGAVAHVCNP S TLGG * GE 
WIT*GREFETSLANMVKPC 


79 


13980 


A 


84 


1 


427 


QQAHIiAGHHRIHTGEKP*KCEACDKVYR 

GKS sl/mk/hrrihtgekaykceechkv 

YSR/N/ SQTVKDRRIHTGEKP * KCKS \ C 
DKAFGHDSHLAQHARIHTGEKPYQCNGC 

gkafsrqstlvyrqaihgvgk:ly*gndc 

hkvfsijat 


80 


13981 


A 


85 


88 


307 


T WTQRRAKXjVRRI gwallp f prp sgs kp 
pp*tppalppcvp/pqsspwtppqppap 
dsre vskdwtqmrs fken 


81 


13982 


A 


86 


5 


373 


NSLIiNTHSRGPQASHCTFWBHERASSRD 
IiTGAESYGICRtiRRLISLS IVFSKSIHG 
IiACIS JLFLFLFIjFIjFFVKTGS wlccpg 
WPQAPELRQSSLLSIiliSSWNYRR* PPHQ 

lifXdfpcrn 


82 


13983 


A 


87 


2 


342 


VIKNEDHYIMIKCLI*QEDQTILNIiHSP 
NK™iNIYKPHMTIirLQKAVDKITirV*T 
CGTSLS 1 1 AVCRIjKIiVKLi/ VEDIiSNI IN 
KLDLM/DN"! Y KTLHLNIRD YTFLKHTWN 
lYKW 


83 


13984 


A 


88 


I 


338 


CNEPRSHHCTPAWRQS KTPS KKKKQKKK 
LTTS /CIKASMKSRVQG*LR/ CWAQVMG 
* *GGVLWIiFVK/REFFTLSINLSKEGHS 
RRVPCLGCLKKKKKKKKKKKKKKKKKKK 
KKFF 


84 


13985 


A 


89 


931 


2 


VASPNALPE*LLPVSGHTHDLERVGVAR 
LGQPHHTELTACRARMESRDTCPGVTLH 
P*PPSFPSSSSPSPGGPRTRLTHQGAGL 
EGSQGPLQSQNPSAA/PLGACRGGWEWP 
QGPGSGS *GG*LMLRELRE/WQEAAVQL 
PTPG*PS*ESGRPLPSSASGPTPPGSP/ 
SP*PGTQGLCGCHPSGLCATAAGPQDGS 
GPTPTPH\ PVQP S CRDSGPGQRLS PTP S 
LTSWPNTRPSPPTGPGPGPGGRATW/PG 
q /pQPOPESPSPOT.PCTTPGflFAn'H'PrP 

O / XT k-^ XT XV J_l J_ 1^ ^ ^ \^ X iT >^ J. -CiVJli XT iT 

AP\AAPSSVAGPIiP/PDGFPRAPASSGI 

TTAPSPGPDP 


85 


13986 


A 


90 


66 


464 


LOTYS S FPRLPLHSGAES PEGD I LWPGL 
TLFLSPSPSPLS\CHSHLSPTCRRPRES 
PRFSLCWRPHPPSKPPPG*VKRDCCPSP 
DPVRLSVSENPSAGPR/VFLRPP 


86 


13987 


A 


91 


197 


2 


NAEIIPAPQKVGPFIKIWVINFFFPFFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQBD 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F~PhenyIalanine, 
G=Glyclne, H^^Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=^ProJine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, =^=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PFF * DRVLL CHPGWS AWQS * IiTVASNF 


87 


13988 


A 


92 


317 


2 


SRTPDFR* STRLGIiPKCWDYRCEPPCPA 
TSSF* *HLSLC*QGNL/WGPPDSIRiyiG 
AG\TRKTKCWEGCHFLFFLRRSL/NSVA 
QAGVQWRDLCPLQSLTPRFKRIiS 


8S 


13989 


A 


93 


1163 


0 


FAFGFEM* YCSVAQAS \VQwYDIjTLLQP 
PLPGVQSDSPASSLPNKLGLQGTSHHAW 
LNFL 


89 


13990 


A 


94 


3 


363 


VLiWPCRPAGPSLGLNFPLYSWLRLQTFA 
AIRPGSTGRRLCLPNWVFTRNSLPFH*L 
VG/ CSSRHNTYJjQECTGHREPTYQIjNIH 
D I KLLFLRFAMEQS FSADTGGGGRESNI 

HLIPYIIHT 


90 


13991 


A 


95 


17 


353 


pepryvgfisggkvdikktwktqgrler 
tvyytgmyfiwchXvlkntdsswgtsp* 

* IQQHAGKRDNNyQIjIj\KLQG\QFSEAY 
TKCYSNPDSTHDVRKVYQDCPLLAPLND 
TS 


91 


13992 


A 


96 


3 


355 


ANYPRDLCKVTDEGGYTKQQIFNADQTA 
IjYWKKMPXSRTLMEREEKSVSGVQASKD 
RLTIiIiLGGS VAGD VK/ LKPMIi I YHSENP 
RIC* S YSLPVLCKWN/NKTWMTAHLFTT 
WFTECLNPL 


92 


13993 


A 


97 


367 


2 


S DHRARRGKGVGVRMGEkIjP S FQ S WRLR 
L/RRRKLCRRWGQGPRGN* Q*QKPDGGA 
RASQTDGGGERRRGVQTGSQAERDTSRE 
TYSPRGIQREIVRVSKGGKTGRQWQSQR 
QIQREKRVGR 


93 


13994 


A 


98 


115 


359 


LNFFFYLLNFMLLFHRYYALKVSYFKSS 
IjDRKLIjELIjWNKYWVNTLSSSSIiljTVS / 
DYTCKECI* ILSKLHSRLISETLFHRK 


94 


13995 


A 


99 


1 


490 


CVDPRVRTIFKKDKVGGLPLI.NLCIYYK 
AT VT * TVWYWHKD / RTDQ WDR I KS IjE IN 
PCI YGQL I FFNYYFFS Q* CQDS SMEGNS 
LFNKWC/CRPLTLYTRIGSKWIKYLM*E 
LKLYRLS LHDPEFGNGFLE I TSE VE I TK 
EEIGKLD\lJiLKLKTFC 


95 


13996 


A 


100 


195 


3 


SRCCKQRHIaVQWNRTEKPE / YGRLIFNK 
GVKTN* WGKNNIiFNKWYWDN* ISTCKRM 
NLVPYLSPYT 


96 


13997 


A 


101 


1 


371 


FSLIKISMMLLMKMEK*NLQFIWN/HRR 
IjQ I AKARLNKKlN KTEG I T Jj PD F K I Y YJCA 
WWYWHKKRHIDHWNRLENSNINRHICS 
QIjrLTKVPGANTKDHPFNEWSWEN\VCT 
KMKPDP/YLSSYTK 


an 
yJ 


13998 




102 


6 


370 


KK.L3-1 XJr'JM Y i? *KJi^ iiJJhJ.blvP-lNl*JNl xAiKJlJN 

YI/PIS*NKKIL*KIiANKIQQHIKKKPD 
NSLFYKIQFGSILETFITINQISKPKEK 
NHI 1 1 STDAENAN FDKIQH 




1 "3000 








z4 


LK*SFCIiSFPKCWDYRHEPLVPGLDQ/li 
FLESNRSVSVLKKNVP * WFSNSSGS ksg 
3SIHDAFCGIS 


99 


14000 


A 


104 


2 


352 


GT I ADFTQCCQAADKAAGLLPKLDELiP \ 
QNGRLRSAKHRLKCASLQKSGKRTPRAW 

T*AGLIHRFPIAEFAEVSNLPTELTTVH 
TECCHGDLLECADHRADLAKS ICENHDS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G^GIycine, H=Histidine, I'^Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N==Asparagine5 P— Proline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














XSSKh 


100 


14001 


A 


105 


139 


361 


KHTE AF * MP CAKY Y \LFGGLNQ I CALPE 
KTPVSDRKTKCCTESLVNRRPCFSAIiEV 
DETYVPKELNAETFTFHAK 


101 


14002 


A 


106 


3 


350 


ELEMIMLSEEVMLKAKIRQKLGLL/QPV 
S* WNAKEKFLKEIKSATPVUTQMI \KN 
SLIAD/MEWWIDDQTSHNVPLSQSLIQ 

s/k/altlfssmkaergeeareeklbgs 

rgwftrfker 


102 


14003 


A 


107 


225 


1 


QGINMDTVEYVGKGEPVRCWWKCKIiVAIi 
L*KTTWKLLRKLKI*\YDPAVPIjLDIHP 
KEIKSIYQRDSCISMFTA 


103 


14004 


A 


108 


390 


2 


LGYSGVRAPLEEAVCRFSDIiKLRAGRTT 
TLFKADRQGHLSLQRPLLTFDSLCP /AP 
RGGVYRGRQASLS CSGLHPVGAS WPRCL 
PTQASAMAGAPAAASLPPCSIilSDCCAS 
NE*GSVSVGPSEPGTGHN 


104 


14005 


A 


109 


304 


33 


KVWGEKVWYWQKMTQIVQWDRTESPQID 

SS/RSLNUSFKDLTAVTKIKSKWVTDLNV 
KHKTIiCLL 


105 


14006 


A 


110 


135 


1 


KQAILWPGAVAHA/WSCNPSTLGGPGGQ 
IT*GQEFEKSLTNMVKP 


106 


14007 


A 


111 


140 


338 


IMS II LYYGS I C YC Y YWL I D * Ii I LRWTL 
ALVAQAGVQWRELGSLQPLP / P CLSLS S 
SWDYRPP 


107 


14008 


A 


112 


26 


361 


RSFFWVFFFEKKSLLGIPGGKKGPPLN* 
LSPLAPGPKGPSGLNPLENGNWGPGPPC 
RGDFGFF*KKTPFPF\GPGGVKTPNPGE 
TF/EPKPPKGVGFP6ETPGPGPGENLYQ 
WK 


108 


14009 


A 


113 


11 


330 


asiwlhiflflsyflskgpryvaqagum- 
llgstgpls asraagttgalhctql* t y 
lllpkhgkavffXqetlllrsthpfplk 

HVYTPTYEIjLLVWDRDSLSRPR 


109 


14010 


A 


114 


350 


2 


GCCP CLIiFPGS PTVFGSLYS I RLVAFVR 
AVPPVWKAVPG* PPVKM* FFHLFI YLFL 
KQ/VSLWHPGWS*LiAQSQVSRFFPRSLP 
SNWBYRGAPPRLTLFFFFFKQSQGIEAM 
VliAL 


110 


14011 


A 


115 


3 


341 


RMVS IS * PRDPPAFASQRAGITGVSHRA 
RPVYSFLLLSWPVMDVPQDIYPFLVDG 
YIiSLPLVCCV*V*VIiIiCyPTWPPGLKRS 
SCLSLPSSWDYRHEPIiKPA/SCCVLLSS 
G 


111 


14012 


A 


116 


1 


384 


PS YPGDIiTEIMDEGGYPETQQI FSIDDTT 
FYWKKMPSRTFLTTEKSIS/GF*ASKNK 
LTFLLEANVAGDFKLKPVFTYHAIjRNPR 
TriKKYATSTIiPVI,CKWNNKA*VT/AHVF 
TTWFTEyPKLSVETCCSE 


112 


14013 


A 


117 


43 


347 


CAAGFGIiL * TPPRT * TRKPRRNGTYKP I 
SLMNINAKILKK/ILANRIQQHIKTKXIj 
LQSDQVGPI 


113 


14014 


A 


US 


351 


23 


FNK*CWHNWIFTCKGMNLEPYFTPNTKI 
lTAIKAIQIjI.EENKEVmjC*LGIiGNGF/S 
RI»PKAQMAKETIDiCLDFIKLKALCSK\N 
TIKKVKKSEELEKIPANHIHGTS IQNI 


114 


14015 


A 


119 


123 


340 


AADSSTHSLTKPAYLEKQNFFFFQMEYC 



518 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leuclne, M=Meth!onine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Argjnine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L I PLAEGP WPNIiA* MQP P PTGFKQSS CL 
SLPSTWD*R/HWSPRLAN 


115 


14016 


A 


120 


1 


371 


PSyP*DLTEIMDEGGYPNQQIFSIHDTT 
FYWKKMPSTTFLTTEKSIS/GF*ASKNK 
LTFLL * ANVAGDFKLiKPVFT YHALRNPR 
TLTKyATSTLPVLCKWNNKA*VT/AHVF 
TTWLTEYFKLSVET 


116 


14017 




121 


82 


367 


YSKRSNTI VAGD / FTPLS ALDS S SRHR I 
NKKTSNCTIDPMDVIGIYTVFHTVSTEY 
TFFSSANGPFSKTDHLLGyKTSI/KTF*T 
KLiK*HQ/CIFSDHN 


117 


14018 


A 


122 


27 


253 


MKTENILGENIGEVPGVGKDSLDMSPKA 
* TIKEKNDKLPF I KVKNPS YS KHTI * KI 
KNQATVWQ/KLIiTVHKSDR 


118 


14019 


A 


123 


3 


305 


GTRQGCPIjSPL*FNTVLEILVR* ISQNK 
*INKT/srjTADP/MVrjHIENPKGSIK*V 
LELIWEFSQVAGYKINM/QKTVAFLYTN 
]Sr*LSICKEIKKTIQFIIASKRT 


119 


14020 


A 


124 


1 


327 


QVIPCLGLiPKCWDYRHEP/RMPGIiRGAD 
FPSLI * IliWEHMF * VMCILPHPCGDFWA 
MLNF*EREGMFF/CLiKRWKSHNVSQA/G 
PQTPRFKRFSRLTIiLSSWDHRHAPPHIiA 
K 


120 


14021 


A 


125 


74 


334 


IFEFGVLKVITCLSVSSHEVGKLiCPFFF 
FPKNGV* FCCPGLQGYNGS * L \ TLELLK 
QSSCLSIi*CTQDHRCLPPCIiABrFNLINS 
SIH 


121 


14022 


A 


126 


381 


60 


TVSQVLAHTCSPSYLKAKAGGSLEPGSG 
YPGCSEL*SCHCTPAWrT\SQTPSL*KI 
H*KKIIVKRAIVECVCVYVYTYIWLLIIi 
KFSDS IPFAQIWGFHEEFPYWJNTQC 


122 


14023 


A 


127 


323 


2 


RGFLGLSDE VSGQQLFS ILSE I ENQFKN 
NLEGCGGSCL/SIPKCWDYRCQPSRLAY 
A* L I F VFL VE \TKNTGFRHVGQAGLELB 
TS SDLP \ AliASQNAWIiTGI SHHAE Py 


123 


14024 


A 


128 


2 


354 


ENCQINNLRFHLRKPKKEEYTKPTASRR 

K* ivnirierngieqkXtiekinelksw 

LFENINKIDSHSGKWMVGEE*CVFVCVC 
VRERQSDRERETRIiIKI*NERGDITADL 
TEIK*LLILQKLNNY 


124 


14025 


A 


129 


74 


327 


GEIjAMLPRLILNS* \PCNPPA\SASIjVT 
GTTDTPLYPAHYYY \ I I I I I I I lEMEFH 
S\VAQAAMQWCDLGSLEPL\PPGFRRFS 
CLSL 


125 


14026 


A 


130 


83 


382 


YEFHASDGGSRLSSQHFGRPRWKDYLSL 
EVEGQPGQHSETPSLSLSriSF/CFIiEME 

SCSIAQAGWQWPYLGSLQPLPPRFRRF 

* CLSLPKCt'mYGHEPQL 


126 


14027 


A 


131 


749 


2 


RQ SLALS PRLECS S T I SAHHNFCLQGSS 
DS PASASRVAGXTGVHCHAKLI PVFLVE 
TRF\THVGQ\AGLELLTSSDLPALASQS 
SGITGMSHHVQARCVXF/CGPTFKIXQDM 
KY*NKKEKNQSTFLEHLQNQQENRHAAH 
TIiRHHTAS/l.KSSEFI,FAIRTSFIjIS*K 
RSGTTGMF*YN/WTIGNGVEDRFVLGPP 
FGIiGVQWYHHSLLQPQSPGLKQSSLFSL 
PSS *DYRHVPTPSFLKFL»RRWGrAILtiR 
L 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso{eucine, 
K=Lysine, L=Leucine, IVD=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamjne,R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophanj Y=Tyrosine, 
X=Un known, "=Stop codon,/=possible 
nucleotide deletion, \=:possfbie 
nucleotide insertion 


127 


14028 


A 


132 


371 


1 


KKNLNLNLTL * VKINTNWIMDLTIKYKI 
IT * KKQEKVGNLEQGKEFSDLTPKV* S I 
KGNTDK*DV*KPFSL*KPM*GNyRQATD 
*KKMSAKHVSNKYP\LSRIYKFNDKAKN 
KPIRKWAKQMN 


128 


14029 


A 


133 


2 


318 


NHKI IIjKDAKKAFDKM*HPFTMKILNK* 
GIEQYAT/DIIKAMCDRPTADI lUSTGEK 

LKAFPLRSGRRPLL\FH* IW-^VIiARTVR 

PRKRNKGSQEWVILI^ALIPSTLGG 


129 


14030 


A 


134 


3 


362 


WSELLGSSDPPALASQIAGI IDMKHYAQ 
PGTDF/STSVYFTLPTQIRIj*AIiKRKFL 
L/EFK* IFASI *QFAKCSCMSJiNPLPAQ 
PYHHWHFPNEETEAQGNFSLLWAC/LRE 


130 


14031 


A 


135 


12 


398 


KCSTS rVIREIQVKTREIYHYTPTKMAK 
IKKPDNAKC/WSGCGATKTLIHCW*!SrN\ 
SL.EIWI.FLMKFNI CLPCDPEIVPLDLYIj 
REMKT YIHENTYTKVS 1APLFK\QPKS P 
STGESINCSIIIPRMECII 


131 


14032 


A 


136 


309 


1 


QSEAPSQKNTKKQKTKVGGLTS * FQTYH 
NATVIKTVWH/W/YIGIDIDQWNSIQSP 
EINFHIHGK/LISHKAAKTNEWGKNSLF 
NK*HKDNRIFTCKRMKVDSQPHI 


132 


14033 


A 


137 


2 


251 


QCGKAFRAASVLRMHGRTHPEDKPYECK 
Q*GKAFRSASHL*MHGRTHNQEK/P/HE 
CKECGKPFRSAQNI,RIQ*RTQAHIRMHS 
G 


133 


14034 


A 


138 


396 


3 


LELLTS * SACLGLPKCWDYRREPPHPAK 
MIL*SKSS\L.PFLiCSKCLNTIWAL,LLLL 
CS L VPLLTIjPLKE F YAAH *SPDS LiKYP V 
LQKLS KLEMLiVLHETVQRKEQLTYSLYK 
PIFGFLVGYSFPLFCSYKTS 


134 


14035 


A 


139 


3 


384 


LDFIKMKNCCSSKELGGKYEKP*T/MEK 
MFAKHISNK*FIFRI*K*LSKLTKKK*A 
KDLNRYS LKED I QMANKCM / KKCS I * LV 
IREKQTiCNTMRYY*NGLS *RS /GQTK* * 
QGWGGTGPLIHCWWEYPMI 


135 


14036 


A 


140 


63 


388 


FMYLFIY*LER/CLTLAQTKVQWCDHSS 
LQPQTPGLKH\PPASASGVATTTIMPG* 
PLFFF * KRGPCWVAHARTT/ WLNLGSF 


136 


14037 


A 


141 


370 


3 


GPGHFPALFFKGPGLGPWPPI IPALLE 
G*GGGFPGP/RGSGPPGQPCFFIiKFQNF 
PGFV\GAPWPGFPGNLGRGTAL]vrPEAE 
• GSINLKGPPGLQPGGKTKLFFQKKKKKY 
QHFQKTQLILCF 


137 


14038 


A 


142 


363 


2 


REKATEENFATS * DWLMMFKEGSHFHS I 
KVLGETPSADVETWSCPEDLI^KIMDED 

gytkqifnvgetXpsrtlitrkeksvpg 
fkasklqrig*lllgvnaagdfkiiqsvl 

mdysenhae 


138 


14039 


A 


143 


99 


375 


KNKNQNISPPLVRYTLLSPLLItPFIINS 
FFFPFFSF/CFF*KKSRFEPKVAFQGGN 
LS*LNPRPKGSKBFSCLSLQSTWITGGP 
PPHWADFDF 


139 


14040 


A 


144 


39 


399 


LQIIYNYIWLYIWLYIFWLVSKIYKELIi 
QXNNKETNKKTGQKT* IDVSSNJIMYMNG* 
*LHEKIHDIVIR*MRIKTTVTYHVMSTR 
MVIIKETENKyW*GYGE\KILIYCWWEN 
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SEQID 


SEQID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 




nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I^Isoleucine, 








26 


correspond 
ingto first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Ijysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threon!ne, V=VaIine, 
W^Tryptophan, Y— Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FEK/SVWPFL 


140 


14041 


A 


145 


358 


3 


REHTAEDKYEASRSWFPRFKEK\SPLRN 
1 1 VQGEAATTNGEVAVS Y/PEDIiAKNID 
EGGYTQQIF\NADETF*YWKMIiFRTFIA 
REKS \RLPSRDRLTIiIjIiGANAAGDFKLK 

PMFIYHSDSC 


141 


14042 


A 


146 


3 


376 


AVMGM YKIWVF VPTNTS I LQ P I D*GVI 
LTFRSY*IiINT\FCNTIGAIGSDSSDGS 
GQSKLKTFWKGPTIPKSXIKWTCDLWEE 
VKISTLMGVWKNVIPALIDLFERLKTLL 
EEVTAYMVNIGRRL 


142 


14043 


A 


147 


374 


161 


EFHRVSQDGliDLLTS * STCLSIjPKCWDY 
SKWQRRI*TQ/DLCDSED*SSIKPIQriC 
QELMS*iyiAEMAHGQP 


143 


14044 


A 


148 


373 


2 


FFFFFFFCDLLLFPKQPRSHASFSLPSV 
VPPPEMPSSSLTLFYIiFVYNLIjR*YL/N 
SVTQAGVQmjDKGShQPRP \ShhSSV!DH 
WCTTPCPAN* IFFL*RVGLAMLPRLLL]Sr 
SWAQAILPIiWSPK 


144 


14045 


A 


149 


3 


282 


ILKRI IHPDQVFF I PAMQGWFNI * K* IN 
TV\NHIIRIKGK\NRMIISLDTVKM'GK 
IQQPFMI\* KLGIKGNFLNLIKGTYENP 
QLAGHSGIjCIi 


145 


14046 


A 


150 


3 


360 


D 1 1 *ES KFQGHHTTGVQKGLQYGI ILFI 

i * evfffagffwafyhsslapmpelggh 
Xypptgifplkrlevp/llntsvlfasg 
vs vtsahhsiilegnrkqi i lals i titl 

GIYFTLLQI 


146 


14047 


A 


151 


349 


3 


kkssgshletksnriipgpsqgpwckeir 
kgslawqekswpfykpfllevgggrvir 

GSRMTRHGHhXRSSKThlKHHVPK*TVQ 

FNFCLF I flemesrs \vaqagvqwcnig 

SLQP 


147 


14048 


A 


152 


500 


0 


MPSRLECSGA/ ISDAQCNLQNSGAPSEP 
SSALSNPCG*DYRDTLPLPGYPIiKFFVK 
TRSRYVTPAGL*LIiASS\lPPTSA/FSK 

HWDYRHE 


148 


14049 


A 


153 


373 


2 


QAGFTLLTSSDPPASASXQSAGITGMNH 
RTWPGNF* PQKS CDS FVTKLMCTCHKNH 
IYAQSLVT*LGHSYIFF*DSLQPSPPRF 
KQFSCHSIiPSSWDHRHTPPCPARTQLHF 
GYEYFSRHFLRRR 


149 


14050 


A 


154 


332 


3 


THKFQRIiTSKSKKLKRKIiIFHQGAQKSN 
SliNPDI KKNLKRSQR3SfME*EKIFTDHLS 
SRIYRELLKLKAK/INLIKMGKGYFYEG 
' DT*MADKHIKRCSISPIIREMQIKTTIR 


150 


14051 


A 


155 


337 


21 


liGLLPFFQRCPAKRGGI *RGSIiAAVALIi 
NCGELHPV*ASWWLCL\PVRGKIjPTEAS 

VMGDAPPPTKLECPRSTSDCCDGSICNFE 

SVDLSLLGSVGVGSVDLDPVAP 


151 


14052 


A 


156 


21 


491 


HPGSRGCSEPRSGHCTPAWGTiCVKTPAS 
TKTKQNKKKK*RTVCLTPWHTLL/RVWC 
DGGDYSSShh 


152 


14053 


A 


157 


3 


394 


REDLLSPGFQGCCEIi*IjCHCIPAWVT/S 
ETIiSQKKiCK 


153 


14054 


A 


158 


36 


354 


IiSAAFTSQLLGRLRQENGVN\CSEPRLH 
HCTPAWATE *DSVSKKKKGKPFWI PNPL 
SSLSPAFISGQGPWESHLKNPGRALQGG 
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SEQ ID 
NO: of 

nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
efh 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Iyysine, L=Lreucine, M— Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=VaUne, 
W=Tryptophanj Y— Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \==posstble 
nucleotide insertion 














GFQPL I CPRPGRTHMSLKS PWRP 


154 


14055 


A 


159 


78 


340 


WFHKAQRGILhAEhYPSGSAGHSIEPK 
TI TLF FLGNKVLLCGPGWS AVA* S * S TV 
ASN\FGPK*IiLCLSPPSSSDSRY-ARPHiyi 
ANFK 


155 


14056 


A 


160. 


2 


362 


HLSPLSIPQNRHCHHGPFSVSCWAHLPD 
GWAGQRGSSI,P/M*GRPGRGAPSPPRR 
GGWPGGGIiTPHLPSGRGGWPGRGTP \ P S 
Q*GRPGRGAPS PPGRGGWPGGGLTPPPP 
SQTGRLAGRR 


156 


14057 


A 


161 


293 


2 


PHLAIRPPTTVDKSALYWKKMPSRTYKV 
RQKSMPGFKSSKDRIjTLPLGAK\AAANP 
KLKPMLID\HSKNTKALKMYAKSILPVL 
CKWR\HKA*MSPHIiF 


157 


14058 


A 


162 


2 


136 


LIVP I L I AMAFLRIiTERKI LGY IQLRTG 
PNVWTPPTGATyRLLS /P * QP * * LFTKE 
P\LEPVTCTITLYITDPTIiAliTIALVL* 
TPLP I PNP VGNLNLGLLF I LATCSLAVY 
S IL* SNRNGI PKTYRTKNSRLYTTTHRP 
QRLDAAYWCYVQTPIA 


158 


14059 


A 


163 


1 


464 


RQGL/DSVTQAGVQGRNLSS \LQPL/ PP 
GLK* SSRFSVSNSWDYRCAPPHIiANFFN 
IiFFVQRG\ FTMLPKIi* TRS \GPGD 


159 


14060 


A 


164 


1 


353 


FNYSSSIjGNKSETPPQKKKKKKKGRGPG 

enpgtptfggakgggnpgikilkttgpk 
ggnpcfkkkpk*phrggrp/hgnpvyrg 
gpgkknfwppggggskg*kspikpppgg 

KKGDFF 


160 


14061 


A 


165 


45 


389 


FFVCTFCRERSLLCCRSWSGTPGLK* PS 
CLSLPKFSDYRHKWPCSARNFLPACLPA 
CLPPSIiPSFFPPS/SPSSSPRQGVTVIQ 
VRVQWRDYGSLQP*PPKTKRSFTIiAFQD 
LGPQ 


161 


14062 


A 


166 


330 


2 


WDYRCAPPHPVLYLKM/STPNIFFL*RG 
VTMLSRLVSSDPPISASQVARI IDVSHW 
AQL * RS V/ CYVFETGSGSLTQAGVQRYN 
HGSVQPQHSRLS *SSHPSI>WNYRYPPPC 


162 


14063 


A 


167 


2 


396 


YSNVLLG I YPKELKTLVHWMF I CV*NT C 
I WMFI TALFLA\ ES * CSS VGE * IGKTWY 
IPTMKYYSVMKRKAISSHEKTWTRLKCI 
LPSERSRCDRLQTV 


163 


14064 


A 


168 


486 


2 


lifgkiykinwssktdnkrgktqiqilg 
mktgdimtyXqqtskeyyahefnnleem 

DQLFKKHKRP/RIHQYEKHHLTGPMTIN 
E FE F INVKTPKNQS PGWDD F I GKF YQMF 
EEFLSENKTGELIL*S*YYSLTNSFYEA 
S IT Y* PKPDRQ * KKQSCGP I S PM 


164 


14065 


A 


169 


200 


3 


GRVDLPPTQESRPPGPPPSDPGVQAPRP 
S I VRNCG I LTRGS PGPD\P S PLS S * AQL 

GGSPPPPSFS 


165 


14066 


A 


170 


2 


327 


PGGIGCGELR *KHCTPTCA'JCERDS V/ WW 
KIIHYIIMXiASPNBLILPLriNICLEIL* 
AGPLTAICISMPiyiATL\FTlAQMGKQPK 
CPSTNEWIRKMWYIHIMBYYLHFKMK 


166 


14067 


A 


171 


2 


343 


PGWSQTPDFR* STCLGLPKCWD YRR/ AA 
TVPGLFIiFLTALC 


167 


14068 


A 


172 


367 


3 


FIRDVQF I S ALRYLLTPERMA/MI KKSK 
lSnSfRYRCGCGK/RG/TLLHCW*KCKLVQP 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing fo first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionlne, 
N=Asparagine, P=Pro/ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Varine, 
\V=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L* * SVWRFLKELPLNSVI PLLGIYPK/ D 
KEVIYEKDSCILVFIAVQFAIA/RNIHP 
TYCPINE*IKKMWYrY 


168 


14069 


A 


173 


102 


345 


YQLQNIPRWVYSKLQNLSFAFVIiIj*RDR 
VFLCHLGWS S LL/ QLKRS FHL YILAS S WD 
YRQVTPRPAN* ILFRD/RGLALIiPR 


169 


14070 


A 


174 


22 


401 


STRLGLPECWDYRCSSSCPATYLPF*HS 
IPFLSQLKVQVHHRRYFHLFPEIiSNLPL 
LSKNVS *TY*KSFLS /WPSLRVLPCFVL 

P CF VTG FHS VQ ARVQ WGNHGS LQ PQP P S 

LK*SIHLiSLPSS*DCR 


170 


14071 


A 


175 


2 


388 


SDLQLRAGRTTALiFKAVRQGHLTLQRFIj 
LSF \ V* LCPAPRGGAYRGRQASLSCGGIi 
HPGRAYRLSCLPKQAWAMVGAPTPASLS 
SCSLISDCCASNHRDSVGVGPFEPCAGC 
NLTEHRFtiSPSGAWSC 


171 


14072 


A 


176 


334 


1 


KLLELIN* FNKVAVYKINTQNQLHCYTL 
IRTTQKGSLKTIIiLTIAPKRIKYLGI /Q 
/L.TKEVKDLHSESYTrj3:jKEIK*DINK*K 
DIPCS*MGMVFFGVCLFVYFCFEMAFRS 


172 


14073 


A 


177 


339 


2 


KNSPYNFPKGKNPGNIi*SLKSRIF/CWG 
STFAHIGELFFFKMGKKFPAI/RPFPFF 
EDRVLLCRPGCSGWRSWLIAAST/CLG 
WDYRCTPPCIAWFL *RQGLTMLPRrjLSN 
SGLQ 


173 


14074 


A 


178 


3 


303 


DLRRAACLILPKCWDYRHVPPRPADMSG 
FliHLI 1 1 * VCLPLLCHAQLVFIFPIETG 
FCHVAWAGLKFLGSSSPLTIiA/FSKCWD 

YR 


174 


14075 


A 


179 


342 


1 


INRLN I FTMAI FSTL 1 YRFNT I P I KI PA 
GFVEMDKLILKLVWKFKAHGIGKTTLKK 
NRVPNS KAY YWATVS KT\Y WNEDRMINQ 
WN*IDNLKINPCVCGQLIFNKDAKTISIEE 
R 


175 


14076 


A 


180 


2 


323 


STS YP I TGSHAFIi * PQNWDAETNS *H I 
NNVNLRLKI IKLLEENTEKNCHDLGLAT 
DYY/SVTPKA *ATTTKIDKhEL IKIKNF 
CTSKD IT* KVKRQL 1 GENSCKS FM 


176 


14077 


A 


181 


326 


3 


RKRKKKRREREPKKDSERKRLYGKDIIK 
KRRKDTM/DWEKILQKGVRQRMCr *KI * 
RGVTQELKANP I RKGGNNLNKVHQRI *M 
ANKHMKRCPKS*VIREI*IPTIMRYH 


177 


14078 


A 


182 


5 


326 


TKTIEKAAN* IRRSRRKKIKFKPEIKNR 
KTTTKINESKSCFYKKTNKTDKLIiVKLI 

rkkktkki rtkdeknhtl\*dctevkgm 
tecyeqvyankfdnsXekmdnfler 


178 


14079 


A 


183 


3 


378 


TVLCSFIVWGLDFPG* IHTI YS*VFCYK 
WNCVHNFIiFRPFSAYI * YLLI FVY*S * I 
MHIjCIiICY*LLL/CSCIFFWIFYMLYFS 
FWIi/ CFIiYF YF YIIiCPFI FF/ CI FI diL 
FP VYLPLLF I PPP I FLF 


179 


14080 


A 


184 


365 


3 


MQIiKCIi^lKS IS VQAG VQWHNPGSMQGPP 
PW\LHHFPASASLVTGTTWARHQGRVIF 
/ 1 PFFFLMRRSL/DS VDQAGAQWHDPGS 
IiQPLLPGFK*FA/ CPCWDDRRAPQCPAN 
FVFLA\RHGFTIIiAR 


180 


14081 


A 


185 


51 


293 


PYVNQEASVLTNSFYQAS/ I *KKSNYRP 
I SliMNIDAKIIJSTKTQQXKLNKVNDSRQT 
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SEQ 10 
NO: of 

nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glyc!ne, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan5 Y==Tyrosine, 
X=Unknown, *-Stop codon,/=possible 
nucleotide deletion, \=possib{e 
nucleotide insertion 














GEDQ I PKYCTVSDGRMMRDYFPLFS S 


181 


14082 


A 


186 


326 


3 


TKICQWILNLGKKREFGFKKKKKIKCPN 
KRHRLANG IKK* DPTTGCLQDI CFRS KD 
AQRLQI KGWKKI FQKTGI *R/GVAVLTS 
DNIEFKIKXVTRDRVANYIIKGSIHQ 


182 


14083 


A 


187 


17 


330 


KDNEM* IKRTJWRCYYTSFTMAKINLKIN 
NIKC*QG*GTTGT/L/LYC*WKHKIVQP 
FSKTV*QFL.MRSNKYIiRYNPAILLLGIY 
SN* VKI /C*KTYMRMFIVFLITKDWKQP 
NYSS 


183 


14084 


A 


188 


124 


357 


SLWGGDIiEGGGEKKERGBNPGGPKKPGK 
KKKKGGNLGARG/VFKKRGRPQKEKPGK 
KGEKGGP*GAGI *GGGGNGVKKK 


184 


14085 * 


A 


189 


333 


124 


RLRQENRLNP /GSRGCREPRSRHCTPGW 
ATE *DS VSKTNKKMYKS 1 1 ARDCHAENS 
IVSFLSKRGCVFLF 


185 


14086 


A 


190 


1 


327 


KSWLIGDNIPSSQSIilQKKGVNPSFKSM 
KADRGKEAAEEKSEASRSWFMREKERSH 
LHN/IK*VQGKAASYPEDLASI IDEGGY 
TKQ* IFNEDY/MWKKM*FRSFLTREKI» 


186 


14087 


A 


191 


53 


400 


VNILGPFHIjFFTFVQNALSPNIYMTHMXi 
TSLGPhBMSShQTGFPKPSYhKFQPSPB 
SVPVGF/IPGMQR* FSIHKSVNVI YHVI 
RiyiNDISHMIXSBCDT*KAFDKIQHXGSLF 
PALSL 


187 


14088 


A 


192 


65 


321 


RARTEIYLIHTLLGVYAGETKTGPYTDC 
M*MPTTAPTTLVKTWI*PKSLSIGERIN 
KL/WILICTYMEYYSAIKSVLTHLTTWV 

NPH 


188 


14089 


A 


193 


127 


241 


IIKNDSRRICNLTEKLPLRKINS/WPGA 
VAHACNPS TI^GG*GGESR I TYNESRNKG 
EQTQNNK 


189 


14090 


A 


194 


1 


337 


DLPAS AS QS AG IIG^ESIRN* NKLVKKT 
LKCSQKDPKEDLNK/WKDIVCSWLGRLS 
IiTKVSILPKIjIYKFAKXPTKIiYYD*KKN 
RPGAVAQAYNS S TLGGRGGQ I 


190 


14091 


A 


195 


2 


333 


NFNSLFFFVEIEKLILKFILNCK\VARR 
ILQRKNKVGRLTLPNFNTDYKATVIKTA 
WH*HKDTHMDQRNRTESTKINTYKGAKT 
I * WRKNSLFNK*OTDK/WJTTCKRKKVD 
P 


191 


14092 


A 


196 


299 


2 


PHPQRDWQPSPPSICRIFSPSSIPGIQ 
S WFNI QKS INVFYHI VI USFTQHRS SSS S 
SSSSSSSSSSY*LiyiIKF/IjNKLGIEGNF 
LILIKAIHGKPTVNH 


192 


14093 


A 


197 


2 


199 


IiALSPKLECTGAITAYCSLEIiLGSNDPP 
TLASKVP/GITGMSH*AQPQWFIiLLiYY 
FIFFFSSSSVL 


193 


14094 


A 


198 


226 


2 


KKFFFPNPNLKNFPLGRVFP/ CS PG*KF 
FAPLGPFIiVSFLRQKVPHLGSPGKTFFF 
PFFFGDRVWTCHPCWSAVA 


194 


14095 


A 


199 


338 


93 


PQHNGSCIj*SQLFERLRAEDPIjNLRVQG 
CNEL*LYIiCTPSWVlTPVSEKKS IQEKNA 
ITLDFREY/RDKRRENWTCV*NS lERSSE 


195 


14096 


A 


200 


1 


361 


PRSHLSQLRVRLSQ/ IKKNQKTIGKGFR 
AIG/MGKKF*GKTLKVPFTRGKVNQWKF 
LKLKSFLPRKKKTEITL*TLGKHFRKWE 
KFFATYPSTRE* ISKICRDLNHFTGKKI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
K=L.ysine, L.=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine5 
X=tJnknown, '^=Stop codon, /"^possible 
nucleotide deletion, X^'possible 
nucleotide insertion 














IIPIFQRAKGL 


196 


14097 


A 


201 


2 


358 


ENKTTMRYH * I P I RMAEN * VLAVGTL / L 
QPWWEYKRV^PLWKTNSLAVTLLNINLP 
CNPT ILLVGI YPRERKTYVYTVACTQMF 
LTVLFTVAPNWKQSQGPSTGE*KNK*WH 
IHIMGHYS 


197 


14098 


A 


202 


110 


357 


IWVFIWCHFLLPEKTSFFPGFPQCLDIS 
* KTLF VHS VLFFFKDRVLLCHPGWS AVA 
Q**FTAASD/FQESSCPSLPGSWDYKRM 


198 


14099 


A 


203 


2 


374 


IliKIiHAFFMAETPGACKTPCDIiRQVFI F 
IFIYLF/MRLSLTLVAQAGVQWRNLGSL 
QPP P PGFKRF *GRCLFYPKDGL VCPGLA 
GS*TGQIGBCGIRTLSPDLCLSDGPCGSK 
PKS/ASASQGHVG 


199 


14100 


A 


204 


375 


3 


LSSILIRGSFNLSTriITTQEHL*IjLIiPS 
W \PIiAI I * FISPLAETNRTPFDHTEGES 
SLVSCFNIEYAAGPFAILFIAEYTNIII 
INTLTTTIFLGTTYDALSHEIiYTTYFVT 
KTIPLTSPALMNS 


200 


14101 


A 


205 


393 


3 


SARLGLPKCWDYRR/ASTVPGI,\SLMLL 
LKSYVSKMKKCE*LVKPLKAICL*DSCKB 
Ii*VGNlSrLIMPSTYDQElSIDRVDSSriTWLC 
VIiLLLLLLIijDLIiFWRQSFALVARAGMRW 

RDLSSL*PIiPPGFRQFSCI 


201 


14102 


A 


206 


2 


377 


FRAVLQGRWSLQKFIiLPFVQIiCPAPKGG 
VYM * RQRS / SLS CGGLRPVRAS WLLCLP 
TQASAMVNAPPPARLLPPRS I SDCYTS S 
EQGSVSMGPAEPGVGYDLLVCGLLRPLE 
KHS I WVRVS CFSRY 


202 


14103 


A 


207 


327 


1 


I LTGNFKQ IRMIiI YHSVNPRALKNYAKS 
TIjPVNYKWNNKA*MTAHLFIAWPTGCFK 

ptvepycskkkkMpfkvlllidnapgh 
pralmkmpeeisivfmpanpasil 


203 


14104 


A 


208 


310 


2 


fcwwapgmgfplgewglilgpgspfflk 
egfpgvgqpgppkrwglkg*ppgahwkg 
f/ ifffadrvslchpcwsavaqsqltan 
pasqvqaidiipqppdyrir 


204 


14105 


A 


209 


374 


3 


gvrdp/leewcpf*dlklvarrt/slf 

FKAVRQGHLSLQRFLLPFWIaCPAPRGG 
VCRGTQASLSCSGLHPVRASWPLGLPTQ 
ASAMVGAPSPASLPPCSSISDCCASNER 
GSVGMGHSETDVGY 


205 


14106 


A 


210 


3 


196 


LENLKF/IiDKFLETYNLTRLNQEETEIL 
NRPMTSSKI * SVIKNLARAMAHAYNPST 

LGGKGRRIA 


206 


14107 


A 


211 


34 


339 


IIKSKRINYMSCELYPNINIDNFFFFET 

VERS VI \klkcygafmvhctlnlpgstn 

LSTSASQVSGAINRGYPGQF/ IGLEK*F 
LVKTGFCHVA*AGLEPLGSG 


207 


14108 


A 


212 


200 


312 . 


HYGQFHVLFCSLF*MESHSVT/QAG/VQ 
WP/ SLAIiQTPPPGFKQFSCFCIiVSSWDY 


208 


14109 


A 


213 


300 


124 


NHI PFYFQIHRVCVCVCVCVCVCLCVCV 
YRYIYCVSPW/RN* *SL*KECLSLVIjrjT 
PTC 


209 


14110 


A 


214 


1 


357 


QVDHTSDRKANLNMF* KIKVIQTMFSDY 
NGVKSENWRRKTGKLTITIQKLNST/L* I 
SNETNEKPQEIRKYFEVNINENTTYQDL 
WDEVSIVAQAHNPSTLEAEAGRLLEPRS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M^Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V==Valine, 
W=Tryptoplian, y=Ty rosin e, 
X=Unknown, '^^Stop codon, /=possible 
nucleotide deletion, \~possibIe 
nucleotide insertion 














LRSAWTTE 


210 


14111 


A 


215 


360 


1 


LTIETYCSE\KKTPFKILLFIDNLAGHP 
RALMEM* KSMNVFMPAKTTS IML PL I \ S 
S FNS YYLRTRIHRTFVWAQSWIiQCPTT 
AI SD I Q I GP VAMGIiLIjI PSAFVLCWIiS I 
QQVSKIiKH 


211 


14112 


A 


216 


3 


347 


SSRSRADGLFFYQCK* QTLYW*HLWYKT 
QKEKKKTQMKEKKQRINKAR\GDKLPEC 
EAVCGKPKNPANPVQRILVGHLDAKGS P 

PWQAKMGSHHNLTTGATL INEQWLLTTG 
KNLL 


212 


14113 


A 


217 


75 


347 


PPIFKRTARGKVTPGWEYPAKG/CGGTF 
IiGNRGLFPNPGFSGPPPKKQVWEGALCA 
P6*KVCTLKF* F I \ SLAI PGNGSKNFFT 
LFEGPSPIL 


213 


14114 


A 


218 


3 


397 


GGQGYSGTGHHGGI WHKMEH*R* PLPPK 
KKKKKKKKKKKKKKKKKKKGGP3KKKPR 
GGQKLKGGGKEKPPPQKGVKKKTLSGRI 
IKKDGREKHTRGELWKKTFIW/EGEKIG 
EKPPKKI*DHEGKKKVLRGKG 


214 


14115 


A 


219 


3 


401 


DS WATLHGNIiyiK* S AAVL * ALGL V\ FGF 
TVSGPTGI VLSNS * LDIELHDT YYWAH 
FHYVLSIGAVFAIIGGLXH* IPLFSGYT 
LDQTYAKIHFTI IFIGVNLTFFPQHFLD 
IiSGiyrPRRYSDYPDAYTT*NIIj 


215 


14116 


A 


220 


162 


3 


KPAQRNL*SNPEICEDINILKRNQS/WPG 
TVAHACNPSTLGGQGKQXIRSGV*SQP 


216 


14117 


A 


221 


374 


2 


WCRDRERERERATERERERERERQTDRQ 
TDRRREEA/EGWAAWAIN* GRAPGrSIiE 
AALECSCPRRPPQPAPPEQGPFPRTTAR 
GQPRPPKLLQPEAPSQTRPHGYPWPLRV 
LPQSGPEVRPRE 


217 


14118 


A 


222 


3 


264 


DHMRPKVRGCSEL * S CHCTPAWAT\SES 
LSQKKKKKKKKRKKIGEKI *GAFFKVAP 
PFFFFKKKMNLKSP VGIAGVAKNTRWI L 
KHRG 


218 


14119 


A 


223 


2 


410 


ATSP I JEEI, I T ^HDHAhlNI LhM FLGIi 
HALFVAI.TTTLTNTNI*HAEETQT* *TI 
LPAIILVLIRLPSIiR\lLYISDEVNDPS 
ITIESMGHQWY*TYEY\TDYSGLILNCY 
ILPPLFLEPGDLRLLDGDDQVALPI 


219 


14120 


A 


224 


399 


3 


GVGKPGFPWAKKWALPPYSKGYPK*VQN 
LILKVKLLDnKKNPGENSLEATGTii^GLP 
PKARAP KLKPNW /DS P KRKNPGAF KKKK 
KRQPTEWEKIFANHASDKRLTSRIYKEL 
QQ * KSKQP I FLIP I YHLNHFKG 


220 


14121 


A 


225 


360 


71 


WRTTWVPPKTPPLIiYHINTKKK*HTGNK 
KHKTKRAKKRKKTKNAKK/EKEKNPHPQ 

RPQPCRKK/EEQKPKRGKKKRGKKKKPFK 
EYSYL*K\YIHTNLQ 


221 


14322 


A 


226 


3 


386 


PS THVS INLAMGlPh *ADAVI 1GFR3KI 
KDALAHFIiPQGTPTPLI PILAI lETISL 
LIQPIALSVRIiTANITAGHLIiMHLrGST 
TLTVSTINLSSTLIIFTILILLT1L*IA 
VAIilQAY/VCPLLVGAY 


222 


14123 


A 


227 


3 


372 


YSLDSPSIiTRFFTFHFMLPFISAALAAL 
HLLPLHETGSNNPLGITSHSDKSTFHPY 
YTlRDAIiGIiLLFLLSLMTI,TLFSPDLtK3 
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PCT/USOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ m 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I==IsoIeucine, 
K=L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pronne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib[e 
nucleotide insertion 














DPDNYTGSRNAPVKQPRPHI \KPE*YFL 
FA/YTIhUPPYKh 


223 


14124 


A 


228 


2 


263 


PRVRSRWEDCLRPGA*DQ/LCAP*RDCF 
FKKKKKKKKKKKKKKKKKKKKKBCKKKKK 
KKKKKKKKKKKKKKKKKKKGGPFKKKPR 
GGQHLKGGKKKKFPPKRGE 


224 


14125 


A 


229 


314 


1 


LIKTIFLYQVjSLCCPG*SQTPGIjKQSSC 
LSLPKCWDHRHKPPCPAQSFYFTFTERA 
LRVI Y*FI *RQS /LRSVAQAGVRWCGPG 
SLHPPPPRFKQLSCIiSIiLSSW 


225 


14126 


A 


230 


86 


405 


VSYMS IQDHENGHVSGYFELVGQFCLRV 
GIVGGSTPESSNSHSYIYFFSFYLFIFF 
EMGLSVTQAGVQWHKHGLLQPQPPG\LT 
*SSHLSFPRSWDHRPVSPWPA 


226 


14127 


A 


231 


386 


98 


FFPQRR*FPFGNFPPFFPPPKLG\PSQK 
SPIGDFPPPPFFNPGPGPPFFFPPPFGK 
GFSFPPPL*FGPPQGFF*RAPPFFFFFF 
FFFFFFFPIRLLV 


227 


14128 


A 


232 


2 


362 


TYVLFVIPYVETDLVQ*AG*SYSMDYSN 
LTRLQTLNLNVPFNIAALATLRIiLYLHE 
TGSirerPIRITCHSDEVTFHPYYTIKDAIi 
G*L\IjFIjLiSIiMTLTLiFSPDLLRDPDNYT 
IiANPLNTPG 


228 


14129 


A 


233 


5 


365 


KNVT P P VNS S QHDLTD I YRNL YLMATE Y 
TFFSN/AHGTYTKIDHNLSHKRNLNKF* 
RLKF * RLKLNS KGVLWPKWNQTTNQ * QK 
DNRKLSKHLKTKNTLLNNP* IKEEVSWE 
ILKMYLRHG 


229 


14130 


A 


234 


3 


367 


NRIVPNHDIPYFSIiCVCVLFFFDRVS\T 
IiSL*GWSAGAGSLLTAAPTSCVQAILCL 
SZjZjSTWNYRRMPLRPAYL/SYFKSSKSF 
TL/IiPKGIL 


230 


14131 


A 


235 


- 107 


2 


QEAYE*DKERSSRIYMELK/SIQPKPPN 
NRVKKWTKEINR/HFSKEDTQTDKRHMK 
KG*SIjICISLVISD*QRNAN*TPNEIRL 
IP 


231 


14132 


A 


236 


349 


0 


NNL * I FL I FLFTS VFLSL I S YLF Y Y I FI 
RSLP I CGLFT YFFS FS Y I FLKYLMTLAI 
YYSF*\FPSLIQVIIiFALIFI 


232 


14133 


A 


237 


3 


317 


TRMHISDKKTYNTNIKRIFRIHLMRQTT 
QliKIGKRYEQTL YKS I FEWT INTYKEVK 
QrjNSKRKPTNTVIKK*SKDIiYKHFSEED 
IQITTRY\IjKKC*ASLIIKEM 


233 


14134 


A 


238 


346 


1 


AAAQMPAYQELVEEAIAYGRKLGGSQED 
QTMNANDQRFVMSAADILS* IPDWSTE 
VHARLS FDKYAMSARA\RRLI ELSKEYG 
LSKGRIVI\KLSSTWEGI,QAAKEIjDEQR 
GIGC 


234 


14135 


A 


239 


1 


279 


LLLPPLPSIiLLPPSPSSCLPPSPSS/SP 
PSP/S/PPPSLLHPPPPSLLLPHLPSLL 
DPPPTSHLIiPSIP*LDIPNAPT\NSSNP 
SCPCRPRSTEYFK 


235 


14136 


A 


240 


98 


2 


RPRRPRLVLNS* XLKLSAHISLPKCWDY 
RREPP 


236 


14137 


A 


241 


342 


1 


DTASIiLQPMDQGAISTFWVYYL/RNIFH 
EAIAAIDSDSPDGTGHSKLKTF*K6FTI 
LDAIKNICDSW/EEGVKISTLTGVWKKL 
ILTLMDDFEGPKTIVGGVTADWETARE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 . 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E—GJutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, 1— IsoJeucine^ 
K=Lysine, L=Leucine, M=Methionine5 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T— Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEL 


237 


14138 


A 


242 


3 


339 


RKDSRKRERKNKEKRKGRKEGRKKPIRM 
TVHFSSKTMVAK/ARRQ*NPVFRMtiEE* 
KN\ FPPRILYSE I I FFGSESQ I KKFSGD 
GNPKGSFPELFFLRIPMEVFNLKEAYLR 
KS 


238 


14139 


A 


243 


216 


3 


NSTGTQ'rQSQKPMI»SLVKKr,ISPiyr,*D 
GVLLFAQDGVKWRDLGSLQPPPP/GPTL 
FKQPSCLSLPSS*GSRI 


239 


14140 


A 


244 


408 


2 


VLTS *SNIAST\WYGLYQ/TQLR/KIIA 
YSS I / THIG/WNNPVLQYNHNI TILNLT 
lYIILTLNA/FLVLNFNSSIGTILLSRT 
*NEIT*I,PSLIPSTIiLSLGGLPPLTGPL 
PKWAIIEEFTKINSIiIIPTIIATITLLN 
LY 


240 


14141 


A 


245 


3 


243 


I IMLKAGQMTVFPNEDVWMAYKHTDRYS 
TSLVMNEMXQIKTTMGYHYAP ITVAKLIi 
N/SNTRYWP*RG/CSRVIHCWRKCR 


241 


14142 


A 


246 


2 


337 


FLIMDLQKYINPKIKEPQGETNHKTVSF 
MNINAKI LNKI LVI QI * * PP KKNNVS KF 
G/FIP*/SQGCFNINQCYSIjDERRKYMI 
ISKNIEKAFNKIQHSSFMIKTKNRKKLP 
EL 


242 


14143 


A 


247 


347 


1 


HYTPIRMTKIQNTDNIKCW*GCGNP/GT 
LLHCW*ECKMVQA1,/WL*TVWQLFTKLN 
LLLPYNPITFFGIYPNEGRTHVHTKTCT 
LIF 1 1 AALFI I ARTQKQ / PRCS SVDE WI 
NKLWYIQ 


243 


14144 


A 


248 


319 


2 


KKIILFIICTERVFIjCCPGWSQTPRLKQ 
■ SSSLGLPKG*DYRRESS\PGLNVIL*CL 
KFHLRVAMhFYVFEV*\lVIFFFLRQSL. 
/ DS VTRAGVQ WYNLS SLQL P P PGFK 


244 


14145 


A 


249 


329 


3 


KFPFFHTGEAKNYAVFVINKRIKKKQHI 
HI lYNYGGHQKAECKEIEAHVHCWWEYK 
MMQ/SL*KTAW*FLKK*TMELPYDPVIL 
LliRIYSKELKAGTRTDVCTSMCSFLFL 


245 


14146 


A 


250 


316 


3 


KTHKT I MGTNH I PNE KID S T * LKlSILiISri R 
AITIKLL\KNNRVSLHDLRIGHGLLDMT 
PRAQAAKKKIKR* IDEFIETKNFCASKD 
NIKRV/NRQHKE * DTMFAtSfHI S 


246 


UUl 


A 


251 


152 


364 


QLTLSmiINNQIKYK*TRHS \iraHHHHH 
HHHHHHHHQ* KRLPE * S YKSKSQEHYT Y 
GMDITGPKNLRTI PIi 


247 


14148 


A 


252 


230 


1 


PCWICEPrVSSNFCWPVLETGSCYVAHA 

RVQWCDNRSL*PQTPGLKRYSCLSFRSG 

WD *RH/ VP lANF * TLPRGGVL 


248 


14149 


A 


253 


3 


345 


DVGLAGXjELLiTSGDPPAIiASQIAGITGV 
SHHSWPLLFFFFFFF/CFFEGGFFFPLP 
V* SQGGDLGSREPWPYGFKGISRIiRFPN 
EGKIGPQPIAPDMFCFFDKTWLSTWPG 
WFH 


249 


14150 


A 


254 


194 


1 


GRVDTKWANTHERFSKCS/TTVD*WXFK 
S W YLCTME YYST INKKE ILS FVTTWMHL 
EDIMIiSEVS 


250 


14151 


A 


255 


327 


3 


VKTAEFVNKWQKNSTKLWNSQAQIDKKK 
IWQINDLRQTEIWMGDRIMNLESRIQM 
QCDWNTSDFCVTPQ\YNETEH*WKKVKR 
HLEGREENLTL* I VKLKEQDFEASQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionme, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 


251 


14152 


A 


256 


314 


3 


KNFPSLINFLRQSFAFNGQGGVQRVDLG 
S PQP LD SRVSDR I RL C FKTQKKKRFS P * 
QTILKLSKV/E/MTERVLKTAREKHLVT 
YK/GKPIRLTMDFSAETLQARRE 


252 


34153 


A 


257 


2 


308 


RW¥IWW\EEEEEEVDCGGEV* FFFFCFFF 
FFFCFYVFFFYIiFFFCFFFFIFPYFLFP 
LFLFIFFFFFPVPFIFFFFFDCFFFFFF 
FFFFFFFFFFFFFFFFVFF 


253 


14154 


A 


258 


52 


186 


CFGKIjGWDDCLRSGVQGCSET*ACHCTP 
AWAT\2SnSTLSriKKRRRG 


254 


14155 


A 


259 


1 


269 


LKRSSHLSLPRSW]SnfRCMPPCPANF*TF 
PVETGSHHIAQAHLKL\PPASA/FPKCW 
DYRHESIiCPALDS PFTNINSATIIT I LQ 
GKCWHYC 


255 


24156 


A 


260 


387 


3 


PLLQVAKINPKRILDIiNVKPKTIKCLQE 
NTGENCWDFGSGKDFLDMTPKMQSTK*Q 
ISKL IKr YNFSS KTQ/S FCTTDHENFLE 
DIVKRIKIQSHKLEKRFVWHIPDKRLIS 
RIYQELFRTQHEKPHTIK 


256 


14157 


A 


261 


62 


72 


LRI IKFSEESMLKAEMG*KLGIjFHQAVN 
KCNSIiTVNEWNAKKRFLKKMKSATPVK 
THMIRK\DMBKSrhE\mXDQZSHNlShSQ 
SL I QRKVXjILFS SMKPARGEEAV* * KNY 


257 


14158 


A 


262 


2 


193 


GGRGCSEtjRS CHCr PAWVTRAKLSLKKK 
KKKNSKF/RELGNKGQFMGP*LRKGFTC 
YKKRSPLilF 


258 


14159 


A 


263 


3 


330 


QLKKKY YEEL YAJSTKIiGNPDEMDKFLE I C 
ILQKLTQHKN^ KFFKDSRRKS KMNRP I T 
NRIilQ^PKELPKRKTLRPDGFTGELYHK 
HFFFKIiTKPLHKFF/AKIKEEEPFPNS 


259 


14160 


A 


264 


326 


43 


TQETEAGESIi\DPGGRGCN*LGSCPCTP 
AWATGMKLPFKKKKKKITSLPELRFLSY 
TCS S S QNFLYLNDDT I IHPVAQAKNLGD 
hDSSPTDPIQ 


260 


14161 


A 


265 


311 


1 


FPP FP / S FPLERKEKFKGRKKRGGTKRG 
PFPPFFPFSSLPLGFP/SPQRA*FLLFR 
FREFLKFFFPFYLPPKPGLGKFFFFFPF 
FWDGVSLCRPGWSAVARSRLTV 


261 


J4162 


A 


266 


5 


323 


DSTKAQKNTWS lEPGEVCTLIHC/ WVQ 
PL* KT VW W FLK / D VK/ LE LR YD P AI AHL 
Gl/YPKKNKT*TCTQVNrTALFIMTKM* 
K*PKD/PINNEWVT/KLWYMHTMEY*ST 
IN 


262 


14163 


A 


267 


3 


293 


GSLQPIiAPRPK* FS CLSLL/ S AGEWHEP 
WRRSLQRSCHCTPSSLGYRAGLHLKKNN 
NNNNKSEVSRICTIGIFLLFIY/CIYET 
ESHSVT*AGVHWLDF 


263 


14164 


A 


26S 


3 


332 


hls i inlnvnqiins plkaytlp vwi * kk 
kwpny\ cclqe ihfaskntyklkvkelk 
kkfqanekekhadktgpksktvkXkkng 

HYJMIKRSVQKBNlYIYMFIilADPDXC 


264 


14165 


A 


269 


3 


326 


KVLERHDVLKLTQ/ DKI CNLNM / P YTKE 
MES I VNITLPKKKTPGLHSFTGEF YQTFK 
NEMILTSYNFSQKFEAEEIIjNSFYEAS I 
IjIi\IPKLDKDIIRKEN*R/PNAKILNKI 
L 


265 


14166 


A 


270 


1 


318 


PYPAKLSSLSKGBIGTP\LDKQMLKEPV 
TTRPALQEILKGALITVLI * /LERKDHYQ 
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SEQID 
NO: of 

nucteotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nu cieotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F~FhenyIalanine, 
G=Glycine, H=Histidine, I=^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Val!ne, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, ••=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QMQKH 


266 


14167 


A 


271 


1 


318 


LKLLTSGNLPASASQTAGITGINHRAQP 
HLSFSL*DWNET/CVPG 


267 


14168 


A 


272 


108 


322 


DSHVl FliLCGNLTVFVCLFLRWGIj/DS I 
THPGVQWHHHNSL*P*TPGLK*SYHLSL 
SSI WNyRRTPIiRIiAFFL 


268 


14169 


A 


273 


341 


3 


GFFFPFPFFRQGFPLLAQVK/VPWGNFR 
SLQPLPPGVKQFSCLNLPKN/WGFSLKG 
FFKKGCHFPF*Y* IFFNGVKKPDIKFSI 

FTNPFFFFFETESCSWQAGVQ*HDLGS 

TRP 


269 


14170 


A 


274 


41 


328 


FWLLTPGQETERERESETERESEREREA 
GEAERGRETDGIDSLIYSKLYSKTIHLK 
XhEYI XATYFKRIiFYNWEIi* T*AK/TRI 
lEKPSVRHQCQQRS 


270 


14171 


A 


275 


3 


322 


EAQEIiL\DPGSGGCSBLRSHHCTSAWAT 
KAKLRFKKKKKKKCPGRYLGNIVIiQ* FN 
I lAKCDKYHI VKL I F/CFSSAYTAS I QK 
LIQVSRTVTHRKNKKTRQYVYILD 


271 


14172 


A 


276 


175 


313 


STLISYFRDRVLI.CYPGWSAVGQSQFTA 
A*NFW\VK*SSSLSS\PSSW 


272 


14173 


A 


277 


254 


3 


RPRRQFGIEGSFLNQIKNIYKKSTANVI 
liYVDRLNAFSLISGTRQGCPLSRLPFNI 
VL*LPVWIIRQ/EKVr*GMQIVKEELWIi 
SL 


273 


14174 


A 


278 


186 


1 


PETMQARRQ*SEIFKVLKEKN/LQHRIL 
YPEKLSFQSEREIKTF\QRLKKFITSRP 
ALQRMLKE 


274 


14175 


A 


279 


288 


3 


GRLRGKKGFNPEGENSKEFFKPLPSGLG 
AKTQ PVFKKKKKKKKTRNP I KK/ WAKDP 
NRYLTKEAI *]yLASRHMKRCSTSCVIREL 
*IKTTMRYDYVLI 


275 


14176 


A 


280 


3 


329 


LKESSRLSLLCSWDHRFALPPMLSGLVW 
NSCPQ / CDPSAS AS LS IG I TGMSHHTWL 
* WLF / C* ETGSHPVTQAGMQWYDPSPLH 
PTKKGVYIiT/RRFWRLGSPSAW 


276 


14177 


A 


281 


233 


3 


1XTKDKEEHFIMIKR3IRQQGXT1TNIC 
VP YNRGS K* MK* KLTKLMGENR * /HS 1 1 
lAGDFNTTFSIMDRAIRKSAKNG 


277 


14178 


A 


282 


32 


309 


LPDITPRDHLSPGTMDFIQETGCSKCWR 
GCGTPVCC*WECKL/VIVQSLSRTVWRF 
LBCKLKIGQVRWLMPVr SALWDVEVGGS P 
EARSSRPAWP 


278 


14179 


A 


283 


1 


302 


CKRinsrKAWMTVHLPTAWLSE YFT PTVET 
YCS *KTIPFTIL/LLDDNAPGQPGVLVE 
MHKDMNVAFRPANTASILQPMNQGGIST 
FSiTSYYLRNTFPKAIVAIE 


279 


14180 


A 


284 


155 


1 


PRRFFFFFF*DRVLLCHPG*STAT*SCL 
TTKSASQV\RQFSCLEPPSSWDY 


280 


14181 


A 


285 


131 


15 


DRSNPGRFLWTSNSSLYERPREIRPSSQ 
APPPVNDP I * T 


281 


14182 


A 


286 


1 


359 


FFFFFFSWFWCFSFFFFKKKGGAPGGG 
GPPPPKKTPLFSPQKKNFFLQKKPPGKG 
FKGPGLPPPNFGPG/RNGGPLKPGDPHP 
PDFTPPRWLFKPPF * KRGEPFFPPPGFP 
KNUjFLKKVP 


282 


14183 


A 


287 • 


52 


419 


LEERAGTSWPKKENKKQHFCMKKKKKKK 
KKKKKKKKKKKKKFFKKGAQ I FS F * GGV 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D==Aspartic Acid, 
E=Glutamic Acid, F=Fhenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=liysine, L=Leucine, M=Methionine> 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=tJnknown, '^=Stop codon, /^possible 
nucleotide deletion. Impossible 
nucleotide insertion 














NL/H/ SGPGPVLKTGGGGRHSPPPPPSP 
PSSLFFPSSSLPSLPFLSLPLPSFLPSS 
PPLLSS/CPPPLFSS 


283 


14384 


A 


288 


236 


325 


liNIRTPGSWLXAVAHACNPSTLGGRGRR 
IT 


284 


14185 


A 


289 


218 


409 


KVKGNKADIiIRSANGS*N/VKGVIiCHQY 
LEKSFCLETEPHSCLPQLVHWPDLGLLQ 
PMPPGIARF 


285 


14186 


A 


290 


3 


426 


HEETGSPRIiECSGMITAHCSLDLPKLK* 
SSHLSLGS/RHVPPCLAHFLY 


286 


14187 


A 


291 


382 


1 


ANFYVTLVQXQGFTMLRSWRPA* PCDPP 
TWASQSAGITGVSHHAWPKMSTITLGVY 
SFGESEVPSIPKPYFLBTGSSFLPQAGA 
* *CNHSSLYP*TPGI*KQSFHIiSI»LiSTSV 
YRYLPPCPIiCPVLV 


287 


14188 


A 


292 , 


3 


309 


HEVPYFTUSTQQLEMIKIiSKGGIiVKAKMG 
MSQIVNAKEKFLKEIKSVIPVNTRMIK 
QNSLIA\ETQEVSLVWIEDQTSHNrPLS 
*YLIQSKFLTLFNSPKADM 


288 


14189 


A 


293 


2 


415 


ARDQYKKSTKK^AKDLYRHFKWEAr/HM 
ASNYK/ KRCS IS* VI RKIE 1 KS SMTS C Y 
THLQN/ALKIiKSDNFKC*QKCTAMGIIIi 
SWWECNLVQSLWTW\QNLLMLWTHKPy* 
SAX/PTSGLYPTEMYRNIEQASLRMFWA 
WRGG 


289 


14190 


A 


294 


335 


108 


IPTPK*PLSSPPIHTPYTTIPKP/RPPP 
P*HPPHIHPPPPPLTP\PLPTHTLPTPT 
PTNLPHIPPLYSIPPSSPKIS 


290 


14191 


A 


295 


266 


2 


GFPKKVFSFPIGPPKLGFPNSVFS/LFT 
P FNPGIiCLKFFQF F P FSAFLPLGPS FP F 
FFFFFFFF* DRAVL CHPGWSAWRS WLT 
ATLV 


291 


14192 


A 


296 


347 


1 


DSLQP*TRGLN*FFHLSFLTRWEYRCAP 
QCPAISrFC/VFL*RWGFAMLPKLVSSDLP 
AIiASQSAGITGVSHCAWLVFLLFSLYFI 
HFSKASETPWAQNQSQTPSEYPATYNFK 

lYSC 


292 


14193 


A 


297 


284 


2 


CRQGFVLCRLVSNSCDQVLCVRLCV*VL 
GLQACAT * LGAEGVS ISFSCFHKGRGCL 
CRHRSS /HSMFSAWC*PHTHTHTHTHTH 
THTPSPMQIiV 


293 


14194 


A 


298 


1 


96 


GTRLCLGAITTLFAAVCALTQKDIiTKIV 
DFST*SQLGLILVTIGINQPHLSFLHIC 
THALPKAILSMCYITI IHILNNDQDSQT 

IRRLLNTMHrNSTS\LTIDSLS*L/GKP 

FLN/GYY * QKDLTKI VDFST 


294 


14195 


A 


299 


3 


134 


HEGRD*PGNHGDTLSLLKIQKIiAGSGGV 
CL\KLRKLMCENHIjNP 


295 


14196 


A 


300 


2 


333 


GVGREGGSREGGEEESGWNG6IRERRSG 
KREEEGKEYRDEVDRKERGEEGGRERER 
GRGWEBEQKGAS * C/ EEGRVK 


296 


14197 


A 


301 


354 


I 


TENELIiSRIYKNFYQIN/RYKQKNSIKK 
*RK*QAI/TTETIQTV1IKHLKKCSISDI 
IRELQTKTR* ILYT*IiVEKKLKTNGI YC 
WECCRELGSC 


297 


14198 


A 


302 


1 


282 


GTRGMVAGAYNLGLPSSWDYR/^CLPPC 
IiANFFVSIiVERGFTRLTTLVLLS * LYDS 
PASRSETAGI PlilHTSCI VYWSLRWNAD 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO; of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO; in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histldine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P==ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














lAILCKAQAV 


298 


14199 


A 


303 


361 


3 


iiPKWWDYRHEPLCLATFFFF*AFPPVPL 
CMYVPPW/RYVAFFFSLSSF* ICCTFTIi 
YIFPQLMDG*AVSSF]:iFLQPVGVKFKKK 
FFLSFFK\KRRSLGMIiPKLILWSWAQVI 
HPPWPRPRA 


299 


14200 


A 


304 


2 


352 


ARMVSVS*PCDPPA*ASQSAEITGMSHH 
AQPIIIS*TIAYVSFSL>SSLGT*ISSVR 
SFTKSMSQLLYFYYFVSL/PSFW 


300 


14201 


A 


305 


334 


1 


KNSFFFFFKRSLGIiliAQI* / VQWGDFKT 
LQPLPPGVKQISRLNLLKKWDY*RGPSG 
LGKFW/ IFIi*KQGFPQLFQVVIiNFFFF/ 
CPPETESRSVAQAGVQWHDLCSIiQAPPP 
SSC 


301 


14202 


A 


306 


270 


2 


DWCAGGGGDGVPRRQVIFVILVETGFWR 
VGQAGFQriLiASSYIiPALA/FPKCLDFRH 
* PPHSALKALFFF*DGVSLLLPRLECNG 
ASSPRA 


302 


14203 


A 


307 


306 


2 


HIFQCVCVCVCVCCWCVCWWCVCVCV 
KI.VISQT*IiSPLCSGP*A/CTPFSVCVC 
VCVCCWCVXCVCVCVCVCKVSDLTNLS 
TLCSGP*A*HFFSVCVCSC 


303 


14204 


A 


308 


117 


332 


SPPVLLRCSLSLSVQLNGKTIRFLK*LK 
MEL*FHSAIPLLGIYPKGKKFIiYQKDT/ 
CHSIFITALFIIAK 


304 


14205 


A 


309 


I 


351 


GTRKTNN*KWAKDLSRHFSKEDIRNGQ* 
V*/HMKRCSASLIIRIM*ISITV\RYNL 
TCIIM 


305 


14206 


A 


310 


1 


352 


PSPPNSPPSPLPSRPPQPPPYPPPPPRP 
PRSPL*YTPPAPPYW/PSPIIiTPSP/PS 
DPPEPRSSRP 


306 


14207 


A 


311 


1 


405 


FKPSP * PIjTGALLGLLMTSGIAM* FHIjH 
S I T\liL IL.C \ LLTNT \ LTI YQ * WR WTR 
ERTYQGHHTPPDQKGLRYGI ILF I TSEV 
FFFAGYF^AFYHSSLAPTPQLGGHWPPT 
GMTPLNPLEVLLLNTSVILAS*VS I 


307 


14208 


A 


312 


3 


176 


HE I LGSNF/ CGETQVS IHCPG*S *TREL 
KQSSHLHI.PKCWHLRA*TTVPGI.TTIFH 
LGKPKHKRVR 


308 


14209 


A 


313 


298 


327 


SLTLSPRL*KIKK*RKK*NKCWQGCKEI 

GAFTHCWWGCKMVQPLWKS\W*LPKKVK 

*KLSYDPAIPLV 


309 


14210 


A 


314 


344 


3 


HSTSLVIREM*IKTRMTYPFTTTKMAII 
*KNRS*\WI,GCGKTGSLIHSW*KC*MVQ 
SLWNTFGQFF I KL 1 * ELT I PLLDL YLRE 
MKTYDHTKTCI *MFI VALFI lAQNVKMS 
LV 


310 


14211 


A 


315 


18 


321 


WALPVYCLJSKKEFGQPPRTACFFFWDRV 
SLCLPGWRAVMGSWVT/VRPKLLAFN*S 
SHLSLPSSWDYRLIPPCSAKDSICEKNR 
VSTVDGADKKVRRGKEGD 


311 


14212 


A 


316 


289 


52 


ETSCDVPSKGIIiHFGKRNTHIMPEKWAK 
GMR*YFH\KEDK*MAHIHVKRC*TSLVI 
MKMQIKSSTPYHFIPPRLTKPLKSAMTK 
CW* TCI WAI YLSSSMKILSHALCPFFWH 
DVCISLTEVKNAFTGHITRRFS 


312 


14213 


A 


317 


57 


324 


YC VTFFFFGKQS F VLS PRIiNCMGP F WVW 
CSLiSLLSTWNYGLVPGTQPFFXEFLVQR 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methioniney 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S«Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /^possible 
nucleotide deletion, \=:pos5ible 
nucleotide insertion 














GFHYVAQVGLKLLTSGDLLP*GSKGAGF 














TGLTHC 


313 


14214 


A 


318 


65 


341 


QWLLKSRSCYFFFL*DRSWAVAQA*VQW 
R/NPPPPGFKQF/S CVSLLSSWDYRCLP 
PHLADFWIFSRDQPDQHSGTSSLLKBCKK 
KREIPNLPGLG 


314 


14215 


A 


319 


2 


182 


ARGDYRHMPPYT\H*LiETGSCFVIQAGL 
ELjLDSSHPPALTSQCWDCTCEPQSWSAP 
CVSl 


315 


14216 


A 


320 


345 


3 


mpprqahfxvpfiemgfhhvaraglell 
GS inpptsasqsswdhr*sqsawi tgvs 
hhaqlgtitsyhillflkkgrahac*sq 
hdpttclhvnlccslllaisrpptttnii 

PRA 


316 


14217 


A 


321 


122 


466 


QSFSTYC*KNWVTI /CFLKS IDS YLVPY 
TKIDSKWI IDLNVKPKTIKLLGGNMGEN 
LCHLMLSKDSSDVTQKA* S lEKQ INKFN 
SHGGTRLRSQPLKRAEVGRVRLSPGGRS 
YSDL 


317 


14218 


A 


322 


3 


345 


HEDAVSAPCNLHLPGSSNCPASASRVAE 
MTVPA\PCPANFFFFFFIjEREFGFCCPG 
/CK*K*VFQKKPKGKKKGRIEGKKGVWG 
PKLKERERKKRKKERRKRGKERKEVRTE 
MMGG 


318 


14219 


A 


323 


206 


446 


GNLHQCPYKWTHLAKYMCEKQDSIG* IL 
KECYQKPLIiEKSHLIAQVENDEKPADW/ 
PIiLiRGEYVEWND 


319 


14220 


A 


324 


62 


327 


VERLLRDLRENFCRNPDGSEAPWCFTLR 
PGMRAAFCYQIRRCTDDVRPQGEAQAWG 
LQSRAGSLEPEGRGEVSA*W/SARTRHR 


320 


14221 


A 


325 


441 


1 


RKKK* ILQGL/EFRLEHWAS PHRDHRV 
WPRQEGKLFSEGKNKATRMTVLYPEE/S 
SKKXGSRS*GSE/CTSVFRITLSVGYVA 
YTGAP VS I S ER P S AAI YHRMNKW * HQHR 
RHLLGFSEQEPVQGVEPLQGSSQGPKEP 
QPSPSNPAPRA 


321 


14222 


A 


326 


2 


358 


ARARTLRIMVNLNYAKSTLPVLCKGKNK 
VWTAHLFTASFAEYFKPAVETYCLEKX 
FFKILQLIGDAPCHPRPLMDMHKEVNVF 
VPAKTS IQQPMD \ IS PFKS YY* EITFQA 
GCCGSCL 


322 


14223 


A 


327 


1 


430 


ARENMPGHLHRCIIEQDWRIHMLIPKIjN 
TQMIKIV*YWYKDRYVDTE/YNNIESTE 
VNLYI CGEL I FSRSAKI IQWWGKRI VLN 
KWCWDD/WIsfL/SCKSM*MDSyLTI,Cr\K 
ITPWIINPTGRAKTMRLLP/EKTGVSPC 
DFG 


323 


14224 


A 


328 


146 


374 


KCLMLTKPYNIMRLT* PYEIR* / CGDTF 
P * S YYA*T6TAVRTWGLTPVIPALWEAE 
AGGSRGQEMETILANTEKPRL 


324 


14225 


A 


329 


2 


322 


arasrtf i vrktqclaskdkltlvrg* c 
scdfqlksmlidysenpratik/nmlykr 
istnkawmtlglftawsteXkptvetycsq 

*\KlTIiKlWh 


325 


14226 


A 


330 


350 


47 


emgflhagqaglelltsgdppasafqsa 

GTTGVSHRARPAN* KKKF/CLKT/RVFL 
CCPGWS*TPGLK*SSCFGLSSHWNYRHB 
PPRPAQFIiIiIDFYIiRHYRES 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
Od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last simino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Gliitamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProJine, 
Q=Glutamine, R=Arginine, S=Seriney 
T=Tfireoni'ne, V~Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /^possible 
nucleotide deletion, \=possib1e 
nucleotide insertion 


326 


14227 


A 


331 


753 


458 


FFFVE*GVL\HVAQAGLKLLTSRDLPTS 
AS * VAETTSTHHHA* LHLFNFEKYFCKE 
QILLCCPGWSRTPALKRYSHIiCLPNSWD 
YRHEHCNEPEEISIil 


327 


14228 


A 


332 


256 


3 


TLVLMQSCL.SLLSSWDYRHKPP/ *LSNI 
FNLPMGLSGHNPI INGGISVSLDRWVIi 
FLFC/LCFETGSHSVAQAGVQWHDHGSIi 
HSC 


328 


14229 


A 


333 


25 


358 


TPDIjR* STCLSLPKCWDYKHVPPHPANI 
FI *QKLFFISDGQYANVLTWFEGGTFHT 
*A*RP1IHHSYKLRKDQTIIFIYLFP/I,R 
WSL/NSVAQAGVQ/WHNLGSLQSLPPEF 
K 


329 


14230 


A 


334 


3 


329 


HEDWSPFWPGWSQTPDLR*SACLGLPK 
CWDYRREPPHIiALNAFIjT *NI FNL* W/ C 
hSGSNPXlRSlCIRFLGKGG*LIjFIFIF 
VWFLTQSIiTHSVTQAAMQWHDLGSLQ 


330 


14231 


A 


335 


26 


330 


SQLLGRPRQENHLNTGGEGCSL/CKI IL 
VCFWLGHIiLPVWFRC / ILYLF / C WSLG 
FNFCFSLVL/CWFLLCFFCAFMFCLFY* 
WFVI CHFLFFFFLFYFFVMFLFF 


331 


14232 


A 


336 


3 


348 


DEVFKS/YH*HPMLKC*GDYSLLFHGES 
DLDTTQILTHPSTTAMYFVHYCQSP^IL 
YGTIDT*PPVIHRlvrPIHIRTPYPCLQAS 
TAINLQIiSHMNCNSKATPHSLGYHQTYP 
PLTV 


332 


14233 


A 


337 


2 


329 


ARAARAARVTGIAWAYHLIGKGLEPFFF 
iyiALLFLR*-GL/NSVTLARVHWYNHGSM* 
A*PPGLRCMSLCPA/NFLYFIiVE/lGFA 
TLiPRLQAICIiGSSDLPTSTSGSAEITGV 
2 


333^ 


14234 


A 


338 


60 


354 


NGKECKKL I YLS I YLS I YLS I YLS I YHL 
C*YL/SCLSI*LSrT*IiAVYYLSVCLSVS 
VCL\VASC1j* is I YLS IHLSFYS IDRIL 
AHHCVP PTCS liPHATH 


334 


14235 


A 


339 


3 


351 


RD ADVMLLQ I LVI FQAIIALGVQTEG^aAG 
VTHHDRVLVNWVRFPQCIi/HRV* SNGT\ 
P^VGTSASERPGLKQSSHPSLPSSWDYR 
CEPPCPVNF/CFFV 


335 


14236 


A 


340 


190 


358 


FGSWLVFFF/ CFLETDSRFAP *AEMQGP 
1TFG*LNPPPPGLRGFFGLSLPGTGDYGP 
V 


336 


14237 


A 


341 


306 


119 


LRWENHLSLGN*GCSEP*FRHCTPTWTT ^ 

E*DSVSKQKQKQNNLELKKPGSS\CWF 

LPETLFT 


337 


14238 


A 


342 


1 


122 


GTRGCGEPRSRHCTIiA\W*QSKTPSKKK 
KKKKKKKTGPFFF 


338 


14239 


A 


343 


92 


316 


VCWDlPIiRKSRTlClLFYPT.VlFLFETK 
ICSSFSF* KGCM/WLGAVAHACNPSTLG 
GQSGWIT*GHE/FKTSLA 


339 


14240 


A 


344 


83 


329 


ATAPAPCYFLIIjCI CMCVCVCMHTSVCT 
/CVILCMCMGLCHIL* TCVA/CTCLGMR 

FLLPAHI IICVKRFCLMCGSGNSGSLCL 


340 


14241 


A 


345 


127 


357 


ygvlsrgrvlipcwpg*srtsgf/sspea 
nswppglanilylvetgfchvaqaglel 
ldsshpptsasqsaeitsmn/hra 


341 


14242 


A 


346 


312 


1 


LMWLNRS VGALLTRGVKCHKAFRST I S L 
LIGALK* *QSKSDLGISLSKWKDLNND 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO:of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gfutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IYRSLLRQSEEDTS\KWKDIPCS*TGTP 
NI VKMS IIiTKAI YRFNASC 


342 


14243 


A 


347 


350 


243 


FHHVGQDGIjNLLTL*ST/PCLGLPKCWD 
YRCEPPRLAY 


343 


14244 


A 


348 


165 


338 


HTLDPASHEGPTFLFSL/PVEQCLK*LT 
LKVAT I VliFCF CF * DRVTLCHPGWS AVM 

QSQ 


344 


14245 


A 


349 


328 


2 


SVCPHGSINLSPAETTGACH/RRIiA*NI 
DSRV* AKTFKGK* TNCSYNNlSnSlNNNN]^ 
WNNKPFLLIjRIiAGDRHPDQIiCPPPRRCH 
FNGATPAGPPSCLRRSHHSSAVCSC 


345 


14246 


A 


350 


3 


338 


HEIEELITPHDRALIGNILICSLVLYAIi 
FLTLTTKLTNTNILHAKEIDTVRTILPA 
IILILIA\LPSLRILYITDEGNDPSLTI 
KS IGHQ W Y* TYE YTD YGGL I FNS YX LP P 


346 


14247 


A 


351 


124 


382 


NTFKS *KrSNKTKNAI RT* AKDTKQHFTG 
EDIQIAITKHMKRCLIPLGKRXSLRQEIS 
PRVRINKIRPGMVAHTCNPNTLGGWGRQ 
IT 


347 


14248 


A 


352 


3 


368 


RDRASLFCPGWS*HPELK*SSCLGIiPKC 
WDYRR/AATAPGLLCL 


348 


14249 


A 


353 


390 


161 


FSRDRISPRWSGWSP/DLRLSP/CLSLP 
KCWDYR / R* ATMPGTFFHFNLKNQDARN 
Q 1 1 TCPNLTNKY I YMTS SF I LVHS 


349 


14250 


A 


354 


123 


345 < 


IiAPQHFASLRLARP / PAGPPCDRPLAPP 
RPS*FSPAPLiPPGADRSVPLSPSVPP*S 
LPPPAGAPPR/PPPPRDSPRR 


350 


14251 


A 


355 


37 


393 


TKHFVSTCYVPE F I AGTNQRKRNRI SVL 
ANVKDKQI IL.LRVRTAITEGYRNYHVLF 
LVF VCFFS P P KQGL / DSAFQAGVWWHNH 
NSL*P*TPGLKLSSHLDLQSeWDYRRVP 
/RMSSYF 


351 


14252 


A 


356 


2 


335 


ARETS KDRLTLLLGAMAGGDF \KLKPVL 
LYHSENPRPLKNYAWST/LPVL^KWNKA 
*MT\HVFTEWLiAEYFKPTVQTYCIiEN\K 
FS ILLINNPLSHPRALMDRFKETIGVLM 
PA 


352 ' 


14253 


A 


357 


199 


377 


VGSSCFLKVCQ/C*AVCVCVCVCVCVCV 
CVCLRCV/ CVCL/ CV\ CVGFFFFLVLCG 
CGCVCVC 


353 


14254 


A 


358 


I 


335 


GTRTTCMYHVPPRSANF*LVLERWGIiPM 
LPRLVANSPPQTILLLQPPTVLPNPERT 
SALY/YRPLLDLSPTSLSPRIPSSPLTS 
SLSlPAPPRSHLiAPPPPRPPPHPPIRPS 
A 


354 


14255 


A 


359 


3 


369 


KPSP*PIiTGALSALLMTSGLA\M*FGFH 
SITLrjILGIiIjTNTIiTIYQ*WRDVTREST 
YQGHHTPPVQKGLRYGI ILFITSEVPFF 
AGFF*AFYHSSIiAPTPQL*GHLPLIVII 
IiIiNSL*FPTLY 


355 


14256 


A 


361 


337 


121 


LWSQLLGRIjRQBDCLSPRSQGQNKP*CH 
CTPAmriE *DSVSKKGKWYIHNKKS SL 

KKK\NSC]MMQQFGGT 


356 


14257 


A 


362 


256 


375 


EKKTIVQYPHTYGINV* KNLPAKKTPGP 
DGFIDKLYNTFRGEITTSPHILFH*PKE 
VAVLIiNSFSK\AASlTLTPKPNICDIMRK 
ENYSPISSYIWNQCSFFLEQIFPLSFKL 
NCPGLIIPH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P==Phenylalan!ne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lrys!ne, L^Leucine, M=Methionine, 
N=Asparag!ne, P=Proline, 
Q=GIutamme, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y~Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


357 


14258 


A 


363 


170 


1 


S QLLRRLRS QGACHHAWL I F VPS VDTGS 
HHIGQAGLKLLS \ ^AIRPPCPPKILARA 


358 


14259 


A 


364 


330 


3 


RIMSASASQSAVISDMSHDVRLIDI*HE 
P*FYQAIjYSFRQ* ITSSPSYR^SYLLSY 
KWQDEARe/AFVTAWQAEAGGMLEPSRIi 
VIiDS * PRDLPTS ASQSAGITDMSHHS C 


359 


14260 


A 


365 


1 


341 


SQYGLDFHDLSDLPTMASQSAGISGVSH 
RAWPASG*FYVSREVlCSYRI.*GyTPSL 
LVLESGSSSVTQAGAQPraNHSlLYPHTH 
GLK*SSCIiSITTR/WDYRHESP 


360 


14261 


A 


366 


2 


338 


ARMVS IS * PRDLPTMASQSAGITGVSHR 
AWPATGKFYVSRGVI CPI I * KDTHLFF / 
CFLETGSSSVTQAGAQWHNHSLL*PQTP 
GLK*SSCLS IPKC/WDYRHESP 


361 


14262 


A 


367 


2 


337 


ARAGILPCH*QRKCKS IRXPLWNNVGYY I 
LKmiGITHDPAIPFLVLYPSEMHTHAY 
*KrSMRMCITTLFIMAIN*KFKCS*SEE 
E/IGKFWYFT* *LIiYSNENEHIjQLHTTW 
XN 


362 


14263 


A 


368 


59 


348 


SAPGIQENEMPAGRGGSHL* sqhfgrpr 
RVDHLRS6VPD* LGDR/GBTPSKKKKKK 


363 


14264 


A 


369 


350 


3 


GWSQIPDIjRWSTHLSIjPKC*DYRRGPHA 
QL*TLMYKYLFEHLLSVLLGICPEVEWC 
/DHMAILCLISLGTViriSSKALHHYTFS 

PVTCKCSNFSISLSTLVTFHCFLNACYF 

PLFSRA 


364 


14265 


A 


370 


908 


30 


ECSGTILAHGKLPLPGSCHSPASAS*VA 
GTTGVCHRARLIFLYFLVETGFHRGLNL 
LTS*ST/SASASQSAEITSVGHHAQL/C 
LELELK*STCLNLPKCWDYPA*ATVPSQ 
FGSVFLRIF*PMFISNXGQLFSCSAFVW 
P * HQGNAGliIE* VRKYFLIiFKFLKEFA\ 
RTGINFR* MS GRTDW * HHP VRG I S LLKG 
F*LVX*CSY*L*VYPYFLLF*EVSLDSP 
RLECNGVHSESAGITGGSHHAQLVYPYF 
FF VCDS VliVGFVFLE X CHFI WLFNIiDVY 
LCLWYYLIITIilAVKSQ 


365 


14266 


A 


371 


164 


1 


S CRQVES YGX KRITCI SLAS SWWYRHAP 
PRPANF\ * FPAEMG \ FLHVDQAS LDSC 


366 


14267 


A 


372 


3 


383 


MVS TPAEDAVNI VDMTTN/ D * E YS LNLV 
DKAVAGLERIDSNFE X SSTVGKMLPNS I 
X CYKE I FHERKNQLMQQKS LLP YFNKLP 
QSIQHSAITTLISRQLSTIWQS/PPPTK 
/RFQLTEGSDD 


367 


14268 


A 


373 


1 


349 


TLLGNDHIYNVIVTAHAFWIFWIVIPI 
I XGGSGN*LI PLI XGAPVMAFARINKLR 
LRLLPTS \ I LLL\liAYAI LEAGARTG* T 
V* PALTRNY*NPEAYGHLSMFSIiHLTGV 
SSISR 


368 


14269 


A 


374 


443 


2 


SSGSRSECRFLPLI.F3WVL.EII»AGAIRQ 
/EKE/ 1 KGIH IGKVKVS IKLTE VI WKFI 
KVA*YKIN\QKSWCLYTNNEQLEKKFK 
XPFKIASKRIKyLEIWH*GKDIiYNENI,K 
TIiLKEMKENKWINIPCSWXRRLNNVNIS 
ILSKVTYKF 


369 


14270 


A 


375 


3 


201 


LWIKBa:iDIIPIIPS*IiFFNTELDKLinK 
LIWKFRGPR/ITQNSFEKPIVGGLILSD 
FKTYYKATVLK 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L~Leucine, M=Methionine9 
N=Asparagine, P=ProUne, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


370 


14271 


A 


376 


11 


324 


DKFLEHPPLLVLQAVAGKGSPLHCLSTL 
PTSLF I * DRVLLCS PAWS AVAQTWFTAT 
ASWAQ*SSNHSPSD*R/PHTTTGHFFCR 
DKVIi/TMLPRLVSNSWTQAI 


371 


14272 


A 


377 


85 


364 


YQHNYSFCFISQYVYRLFFFF*TFFKKK 
GFWF *RLWFQGGNMG/SMEP * PP * IRQF 
FCLNPLN/ SPDYRGPPRGANF* I PLKKK 
GFQVFPSLSKIP 


372 


14273 


A 


378 


1 


382 


gtsgtsystifagtiiitalssh*pft*v 
gle3:nmlafipvlhqkyt\p*ythaaim 
hfltdsga/svllie illlysnslge* s 

S I lEIiQLIS IFLSMMLVIPNKFGAAYYS 
IHALCLDRPQLLHILLI 


373 


14274 


A 


379 


24 


323 


ipglkrssrlgllkcwdyr*dnfqhmcl 
iastfnnwaayyfvcki ccavdphgcs ikv 

S FFLS F / CFFFTKSRTVPQAE VQGGDLG 
*]:iEPIiPPGLMPFSGLSL. 


374 


14275 


A 


380 


2 


317 


AGWFQTPDLR*SPCLSLPKC*DFRG*PP 
CQKLFLCPIKIF*TRIiK/CYLNTL*QSL 
PLMHFKKNVIYFILYKAALFPFFLRRSL 
/HSVAQAGVQWHDLG/S/LQTPSPGFK 


375 


14276 


A 


381 


1 


323 


VKRQPTEWEKIFA/TIiYPSGKGlilTRlY 
KGLKQLEGKN/KSNNL ILKWAKED I QTA 
NRYMKRCSTSLI IRE 1 \MQIKTTTRYHL 
TPVKMAFIKR*/GNNEC**GYSEKRTLI 


376 


14277 


A 


382 


2 


248 


TQPL I LRCLPPRS I YRFNS I P I KI QVNF 
F\*EKEKSLLKFIQNLKGP*ITKTILRK 
KKVDGKTFPDPTMYYKATVIKTVSWYQ 


377 


14278 


A 


383 


178 


1 


HNPPLAISFCFFPFFFFLRQGIi/NSFAQ 
AI VKWYDHDLLQP* PLGLKRFSHIiSriLS 

SWDH 


378 


14279 


A 


384 


312 


26 


FLRGVFFFFFLRRSLi/DSVAQAGVQWPS 
FGSLQ APP PG FMP FS CFS LL / S PFFFFL 
VE /MGPTMLARKVS I S * P FGP PAS ASMP 
VEITGMSHCLANMFF 


379 


14280 


A 


385 


1 


849 


FFFFKQTKFI KLSKYKNI XKKS / SAFL Y 
I SNYLKMKFKKI P S T * L * FEVNLTKKLK 
HLTFYSKEHYTN*VTHKWNNITHS *TGI 
FNS * I FVLHKMI CRYNATS I KI P VT YF 1 
DI F/EKAYLKFI WYHKTP * lAKAlKTKE 
GI /LPDFE IHYKT WTKTVWHLNKNRDl 
GQWSRRKREQKYISVFTAN*F*IQVTFF 
FKGNNS I FNK* CLENFiXISTCR\ KKK* DP 
HLTPYVKINSK* ISHLNVRPKTLKLL\H 
QKIE*KPHNIGLGSKFFDIiT* ISQDTKG 
RTSQSDHF 


380 


14281 


A 


386 


3 


318 


LREMQNALESLSNRTEQVBBTTSBLKDK 
AFE*TQLNKDK\KKRI *KNB*RLQEVWD 
CVK^PDLRIIGVPEEEGKSKYLENMFEE 
I lEQNFPGIiARDLDIQIQVAQR 


381 


14282 


A 


387 


1 


382 


PTPTRTAWKK/ SOTWCWRGCGS IGTLR 
HCSWECKMAQLLW*TV/WTFPRKIRQPS 
DVCCDMVIGIiPYDPAVIiLLGICPREMKT 
YVHTADMSVITS VSLVI I XADSENlJIiNV 
PSADEWINEMWYIHTVDY 


382 


14283 


A 


388 


1 


341 


hklenleni vkfl * th/ tlprlnqekiq 
tlnrsitsskikslikntptrkekkk\n 
pgpkgfpreffpraVkkqgvptlgnppq 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A— Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Pheny!alanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, l=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 
' sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine^ 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














KFRERGSLIiNYFYKPGLPRIPKLGKNPR 














GKKK 


383 


14284 


A 


389 


168 


1 


KKFFFFSFFF*GDRVLLCHPGWNAVMQT 
RLTAAS \TPGLKQSSHFTLPSTAGYTG 


384 


14285 


A 


390 


337 


I 


TGITPIJSrPnEVXPLXNTSVLLASGVS IT 
*AHHSVIESIGNLI IQALLITIIiLGLYF 
TLVQASE*FESPFTISDGIYGST*LVAT 
. GFHGLRVIIGSTFLTICSIRQLIFHFT 


385 


14286 


A 


391 


1 


235 


IjNFSYESSMYFAIjFTIVF\WVFIi]SrF*KF 
FMN* I YHLCYVIi* YFLLV\PVCLLTWFM 
VFCFFFFFFFFFFFPFFFCFFFS 


386 


14287 


A 


392 


1 


258 


SCDRLFANHLSNKEIiVSRKYIYIF*KSQ 
DSTIRKQTDKK*AQEIjNRRFSTKDIjQMG 
NKHMKRCSTPLAI /REMQIKTMLRYHCI 
PIG 


387 


14288 


A 


393 


2 


317 


LAYCNIiCIjPGSSDPPTSS SRVAGNYRG\ 
HHDSVF*RAEDINMHEIQFISFLFLNRD 
EVSLCCTGWS * TPGLKRAS CLDLPKCWN 
YRHEPLCIAFFFNGSWFQCQI 


388 


14289 


A 


394 


91 


408 


LGAEFDVRAYLTSGRLrTGPGVPFLTGFy 
SKDHII\ETANISYTNA*ALSITIjIATS 
LTSAYSTRllhLTIjTGQPRFPTIjmiNE 
DNPTLLNP I KRLAAGSLFAGFIi 


389 


14290 


A 


395 


3 


165 


RNKKLKNNRHWRGC * E KGTL I HCCW WCK 
LVQPLWKAVW*VLR* /LKTELPFDPAIP 


390 


14291 


A 


396 


661 


1 


LCPLSSFYRKCABIiGFWFVCLFVFQIRV 
SLiCCPRWIiQTPGIiKGSSHLSLPISWDYR 
HLPPRLAIGAVFLIFVLMRTCLR*FADP 
ISIiSFE*QQRN*LHILISLL*iyiVDrCLT 
QPTKWIFL*PKNS* *LKRQCKLPKITEL 
/PKNRIQIYPTD^MSSIPHSLSFYLVCL 
F VCL F * ME S CF VT / TMAAVQWHDLGS \ L 
QPLPPGFKRFSCLSIiLRSWDYRRPSPC 


391 


14292 


A 


397 


320 


I 


PDSKQQXFNVDETVFC/WKKMPSRTFLA 
REEKSMPGLKASKDRLILLVQANPAGGF 
KLKPVLTYH/SENPRAIKNYAKS \TVL* 
KVJNSKVWMTGHLFTALNVLSPRRAL 


392 


14293 


A 


398 


1 


158 


CIGPMWENRIiILGGRGCSEL*SCHCTPA 
VIAT\SKTLSQKEKKK*NMRKVM1QC 


393 


14294 


A 


399 


264 


1 


LINEFSQVAGYKINKQKSLVFLYTKY\Y 
SKLSEK* IKKAIPFTIPAKKKEIKYIiGI 
NLTKDVBI)XjYNENYKILKKIEDT\KKWA 
DTPC 


394 


14295 


A 


400 


3 


343 


HEQKRQSKVREVRELSQGLNMDRKRWSQ 
DLNPGCRTVALSPYHHTRIiALNCPGRQW 
FVGVNFF*RKKIFFFFKFWDRVIiLCHLG 
WSAWPSWLTAALISW\VK*SFRLGLiLS 
SW 


395 


14296 


A 


401 


1 


345 


GTRKNTDNTKCR*VCEETGYFIHCWWGY 
KMVYPLGKLVWHFLKKVSIHLPYGTSAL 
hShMlEKIiTFT/CHTKTCTQM/FBVVLF 
P I VKKWKQPKCLPVGEIiLNKIiWYIS IHT 
lYSAI 


396 


14297 


A 


402 


102 


355 


DRVTRLANFCIFGRDRVSPCCPSWARTP 
GIiKRSTSIiSLPKCWDHT*ATAPG/LRAI 
liliYFWDYRCIiSPDLVWFACEHFSVEDVA 
FF 


397 


14298 


A 


403 


156 


3 


NNKNGFP^SMLFQPSNGLVSI *NI \PnK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ IB 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ lO 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
cor res pon 
ding to 
jiast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~A!anine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalan!ne, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=:Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireo/jjne, V-VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \~possible 
nucleotide insertion 














*QKWFSFFPEERVSLFHPGWSAW*SWL 
TACSLDLPAQVILPPLSLPNSSC 


398 


14299 


A 


404 


I 


364 


GTRSEVIYKLLLQTPWVl^HMRTRLLLQ 
ACGAR*MW!yiIL\MIILGLVTCRLTIYQ* 
WRDVTRESTYQGHHTPPVQKGIiRYGIML 
/FITS*WFLSCVI\WVFF 


399 


14300 


A 


405 


3 


365 


HEQTSHNI PLSQS P I * SKILMLUSTSMKA 
ERGEEAAEEKLEASRGWFMRFKKRSHVH 
S\KVQGEAANADVEATASYPEDLiVRTID 
EDGDTNKQI F\NVNKTAFCWKNMPSRTS 
TAREEKQCLP 


400 


14301 


A 


406 


3 


356 


HERSDQLYAlSr/ KLDNLDKRSKFL\EGYK 
liQKLTQKEIQNTNRPTTRI /E I SKKKKK 
KKKKKKKNFSPGGFTGE I PPTFKGQFKK 
IFKKFPKNLGGEKTIiPI * I YGAG/IKLL 
PKIKK/DPSKKK 


401 


14302 


A 


407 


116 


362 


YKY'SLTPQKLYimSYIKKiraNKQNNRWH 
STTTTTKQPPPPGFKRYSCLS FPLS * DY 
RCTPS CPVNX.F/CVFhVBTGFmiVG 


402 


14303 


A 


408 


3 


360 


HEVRLWDPAFERNEGGGENEEKVDWLNY 
RMWFHLIP*AECYSICRLHAYYSCIIjGP 
VLLLLFVLIILLFCCL/SFFDYF/VFMF 
FIFFYSFFFLLFSLiFFSLFL/CFIFLYF 
FIF/CFFLYFFFF 


403 


14304 


A 


409 


388 


1 


ALLKFPFPKVLKKPP*GSFFP*GF*LLS 
/LIFPPYFGRETLFFFF*NKVPLCPPDW 
SSMARSRVPAGSVFPVK\ESLSLSLLSV 
PPQVQVNGFLTFFFPFFSW*RQGLPLLP 
RMVWNSWAQAIIiPIiWPHAS 


404 


14305 


A 


410 


1 


386 


VFNAEESAY/YWEIMPQRKFIR/ESKQA 
P*FRQERIS*YCANAWFTIRTTHICKP 
ANPQALKKKKKKKHQLTVFWLYTKKS * T 
MRTLFLDWFCQCFVPKVRKYLASKTLPL 
KVLL ILDHAPGHIiKPHI FN 


405 


14306 


A 


411 


2 


417 


AHHIFTARIDVDALAYFTYVTIIIDIPT 
GDEPFS *LATLR * INMK*SGAVL*ALRL 
IFLFTVRGLTGIALPDL*LDIVL,HDTYY 
VG7mFHYGLSVGAVFAILGGFID*WPLF 
\SGYTLDRPYAKIHFTIIFIG/VDLAFF 

p 


406 


14307 


A 


412 


295 


397 


wqwpgt vahacnprtbgs * ggrv/ tlrs 
gvrdqp 


407 


14308 


A 


413 


437 


3 


PGFGSLIGNLIPASGNGRKSKSCVCVCV 
CVSVGFKSCr^SLCEHLFTCLCPQrCVR 
LQLMV/ CPQNCVCWCPESGLFLDES VCV 
RLCVCAWMAVWMGGSGSG*VCGCGC\MC 

ICVGPLLDSELCLCVC/LLGQSKCDCGL 
RCRLPAWCV 


408 


14309 


A 


414 


2 


392 


HLQFIFFWIIiKIFHYL/FLWFWFYIiTSM 
ALFyLQ*QKRNCMYEVFNRGi:iFF/SC!GB 
GVSGSPASSSSSPSCSSSTRGGGAVGGG 
GLGFVCFIiLLFWGFWFVFLFCWPLVCVG 
WVWCVWWVPFLLGV/ CCWCFFP 


409 


14310 


A 


415 


376 


1 


GFQASKDRB/ LLLG/ ATAAGDLKLKPML 
IYNSKNPRVLiKNFAKCTLPVLYRIiYR*K 
NYAWMTAHLLKSWFTEDVKSTIQ*KISF 
EMLLl IN3SrVPGHPRTPMEMYKELNFF\M 
PAJSrrS I P * PMDQG I VL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
tJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D==Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histid!ne, I=Isoleucine» 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=spossibIe 
nucleotide insertion 


410 


14311 


A 


416 


409 


2 


FFPIKKKKFAGALVFFFFFRKIFFFPPK 
GEGKRGVLVSLNIiLPLC*KDFFSPPPPG 
EGVLWPPPPGPIFFFFFLKKGFCLFCQG 
GFYKTPSLYLIPIiSPFKIWG^TRGFT/liP 
PPALFPYFFFF*DRVSLCHPGWNA 


41] 


14312 


A 


417 


425 


3 


RELLAPWQNFKLRPQGPFFSP \GPGGEK 
KR*SLPKTPPKGGPPGP/SPNGRI.PSGG 
QLPPPRGG/SPQGPPPLPKGPQGWGPPP 
P*RSPPCPPHRIGVPQVSSSPTPGLVFP 

RGPLNPGNLGGTKKKKKISSKAARDLEL 

VRTRG 


412 


14313 


A 


418 


1 


346 


L.LPDRNLNTTFFD\PAGGGDLILYRHLP 
* IFGHPEGYMLMLPGFGI ILHIVTYYSG 
KKEPFGYIGMV*AMISIGFLGFrV*AHH 
XFTV* IDVYTRAYFTSATI I lAIPTGVK 
VFS 


413 


14314 


A 


419 


2 


382 


LFSTNHTDIGTLYIi]:iFGA*AGVLGAALS 
LLIRAELGQPGNIiLGNDHIYNVIGTAHA 
FVINFFIVIPI 1 1 \GGSGN* LLPLI IGA 
PDMAI ARINNI SFRLLP PSLLLLLAS Al 
VEV* SI^TG* 1 1 YSSL 


414 


14315 


A 


420 


261 


378 


KKNFFFF*KLNFP*KFFLIPFPPKKKIF 
FKKKKKFFFYKIFF/I*KNIFFSPQKNI 
*PFLFFFI*YPFFFFFFFFFFFFFFFFF 

FFFFFFFFFFFFFFFFFFFFFFFFPTIiN 
SLYFSGRVGGRV 


415 


14316 


A 


421 


124 


428 


KPAATHACAT I FMCLDQEAI I SN* H* AT 
TQTTE VSL \ S FKLD YFS 1 1 P I P VALF VT 
WS1IEFSL*YINSDPNINQPFKYIjLIFL 
ITILILVTANKLFQIiFIG 


416 


14317 


A 


422 


8 


355 


pvfsynhstlltfslslsffffffapgk: 
kgekngkpgpfrggffpkkikkpfppp/ 
ggpkkppgpgfpqkphfpfkggpwpskt 
lfgrggkpnfggpqkpplv*nppsqgp/ 

HGPSGGG 


417 


14318 


A 


423 


356 


3 


KPLGIDLiTKKVKKLSKKNYKTLMKKIED 
DSN*KDNSCL*IR/iyiTILPKAIYRFNTI 
S VN I P MRFLiPDAWADAW 


418 


14319 


A 


424 


2 


338 


PSVRLGGHWPRSGITPLNPVE/VGLLNT 
1R1.LASGV3XTGAIIHRLIECNRKQ1XQA 
LLITILLGIiY\LTIiLQAS*YFQAPFTIS 
DG I YG* TFFGSTGFQGIjHVI IGSTFLT I 
CF 


419 


14320 


A 


425 


55 


342 


GPFTP WS LC* GDLQR* P / RAVKFFLKKK 
KKKKKGNP IRN*RDISS* FLKNLETAVR 
NFWTFFSYFKSKKLIQKSDHPLKNVKKI 
LQNDKRYLKLDW6 


420 


14321 


A 


426 


2 


367 


drrrfctyhkdigalylllga*agwct 
avslliraelgqpgnXllgndhvynviv 
sahafvimff\meyp 1 inggfgn* lvpl 

ISGAPDMECSGINNISPWLLPPYVLLLL 
ASAIVEAGSRT 


421 


14322 


A 


427 


359 


74 


IGADYTRKPPYiyiGAPIEFSCATCVL*IiT 
STQRECVCVCVCVCVCVCVSM*VALIPD 
SN/CIPLSAYVCVCVCVCVCVCVCVSI * 
VALLLDSNSYH 


422 


14323 


A 


428 


52 


361 


NYPQLSEI * * *VS*LHY/LFSFLPFYFF 
*IxVSRFVARI,ECSGAWH\ I IAHCSLDIjL 
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PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutam!C Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R==Arginine, S^Serine, 
T^Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














GSS / DS S PAS AS L VAGTTGMCHQNQ 1 1 F 
LFKKVFVGRVQWLTSVIPATWE 


423 


34324 


A 


429 


105 


361 


SRLFFFFPFFKTNFGFVPKVGVK/WAFF 
GLMEPFAFRVK* FSGLTHPRTONYiySIVP 
HCPVULEF* VKTGFNLiVDKAGPKLLT* K 
DFP 


424 


14325 


A 


430 


1 


379 


HAYHIVKPSP*PLiTGALSALLMTCGLAM 
* *RLHS ITLLILCLLTNTL \S I YQ* WRD 
VSRESTyQGHETPPVQKGL*yWIVLFIT 

SEGVFFAGIF*AFYHSSLAPTPQLGGHW 
PTTGITPLNALEGPIj 


425 


24326 


A 


431 


1 


390 


KCTKKWKNIilFMIGRlJSDSJWMSVIjHKVI 
YKFRAICIjR/TFHFHRKKKNLTWA*KHK 
KALIPKTILKKNGKSGGITLSDYKM*YK 
TTITTTIRFWWKP/DrNKYNKITTHEIY 
FHIYGQMYFFKIAKTPQGIK 


426 


14327 


A 


432 


1 


392 


TRTRGRTQWD* T* LRPI / WTKKKKKKKK 
GKKRKKKKGVFFFY FFLGF* FFL WFFF 
FFFFLGFFFFFWFFFFFFFFFFFWGI FF 
FFFLLIIYNFFCKF*PFFFFFGFPNFFF 
FFPWFFPPLDFFFYFPFPRV 


427 


14328 


A 


433 


382 


1 


RGGKFQWQKQKRIOrSPPPPKKGFFPPAA 
PKNIKGGGKK*TPPKKGGVS\PPPPKKR 
KSPPHKK/IGNFFPPPRGKGGPPBCNPKK 
PGPPFFFFKNPPPFFFFPFFFFLFFFFA 
HKDGLLAREQTQAEVKT 


428 


14329 


A 


434 


375 


1 


KYSQL I FDKTAKAIK* SKDCLLKE WFWN 
KWTSTHKSLDL/SPFTKINSK* ITGLNL 
KCRI ITLLENNIEENIiDRLGFGNDFLTT 
liPEAQSKTELISY/DFIQTNFCSJyCTATV 
K*MKRQATEWEKI F 


429 


14330 


A 


435 


1 


459 


PTRPPTRPSTRTLGFTMLAKLGSNARPY 
/DYPSYSASHSAGITCVKHCARAVIHDF 
NGVHRPLLIW*EFI>VEISFIFDFFLETN 
PGFITKGELQGHNLGSLQVSPPGLTLFS 
CLSLQKSGFYG/HXiAKYLVKF/CFGIFS 
KRGF*GGYPGGAQSPPP 


430 


14331 


A 


436 


14 


382 


MARNTS QKDIRI D PNNKCLWLVKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRKNYFHVI \LKNKAVNNGSTITS 
TNVMS PGLHIRL 1 1 1 LS LM I YKKS TTE * 
VEKQTCYYILM 


431 


14332 

) 


A 


437 


3 


192 


TPGLK*SPCLGLPKCWDYRR/AAAVPGL 
*DIIiAIFFHHILSESCLPLLVFIHCFNA 

NLFLRWSS 


432 


14333 


A 


438 


1 


364 


DRIALGT VDDL PGRPTRP \AHP 1 1 LKVS 
LADRDAI *NI,WQIPIVAS*Y1PLGF/WS 
KAMPCSVDIYSSFEKKKKTVFFDRGQGP 

TPEIQVLGKAEGVKPFEPRGSKPSWRIM 
AKPKFYKRFKL 


433 


14334 


A 


439 


336 


2 


GMIPEMKGNAGP / SG/ PRKPGFFLG* KG 
KVKAWLNFKPPPRAFWPBCKKILLPLGPS 
GILGPGPKGNLSRPGPPPQAPPLGFCQV 
LGP/SGPGVGFPKPGQIGWSNNPCLTLQ 
KNK 


434 


14335 


A 


440 


1 


333 


HKLENLENI VKFL * TH/ TLPRLNQEKIQ 
TLNRSITSSKIKSLVKNTPTRKEK/SKK 
PLVPKGFTIEPSQLCKKIiRVPVFMKPFQ 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Giutamic Acid, F=PhenylaJanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V==VaIine, 
W=Tryptophan, Y=Tyrosine, 
X==Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KYGERGLFLNGFYEPGLTLIPKFGKNPG 
G 


435 


14336 


A 


441 


320 


I 


WGINNWLSP*EKKGRPFLL\SHPKINSK 
WIKDLNVKGKI *KPriEGQIG* YLMIFGV 
SKGLFKGGPKYTLYKRKY*NRELLFIKR 
YFS LTEKS PAE I CHI KKLAKKK 


436 


14337 


A 


442 


8 


391 


ERI LRHDQVGI I PGM/ QGWVRI * ESVDE 
I /QHV/NQIiPKITHTITSVS VGKARDKF 
NI YS * FKLGKTEKERNFLHWIKS ICTS P 

AAS ILLEGKES PF \ PFRSGTRKD/CLL * 
P / LG S GM VLE ALGTAERRTRKE K 


437 


14338 


A 


443 


363 


2 


HHVGQACIiELLTSGDPSNLASQSAEITG 
MSYRAQLSIVTFSAYFLW*KLSHARGL 
ML*QS 1 1 1/HVLHFHQVKKAFHVS/PEN 
SQPLHNVBPEDWI F * EHQRKTVPPIHTA 
RKLQDLEPGLV 


438 


14339 


A 


444 


1 


900 


DSS AG I T / GI CHHAQI I / LF VF WETGF 
HHVGQAGIiELLTSGDPPASASHTGGDYR 
HEPPLLASLSFLNKEIjCTWPERRRKP I F 
SLPKLDPNKRKFSFPP*LF/IGS*TLIW 
SLPCFRNVCTQIiADPTKSIAYQSSLMKP 
FGQKS I CFGSEKFPDKVYLFSSDRKAKE 
QVLVWWVEY* I *DMSACFQ*ALGFCY 
CXjS IWN/ YELPEKFEVIACSLPSRNNDL 
ILSLKKKSQNSFFVCVFFFFKTKSNFVP 
QAEGQGPLFG * LKIiPIiPGFRNI S CIiNIiP 
GSC\QTGAVPPPPVNFGFLKKNRVSP 


439 


14340 


A 


445 


3 


346 


QIGKVKKLNK*VPHEPSKNiCL\FLEASS 
LILClOTSrBPFLSRIVTWDENWILYDNH* 
QPAQLLD*EAPKPNLHQKKKKKKKGLAP 
FWGAFSRGNPFYFFNPQQKLHF * KFSSQ 
KRG 


440 


14341 


A 


446 


2 


246 


FKCGKTRALMHCWWEGKMLRQLXRKVWQ 
LLAMLNMELPRE PAAPLLS I S PRERK/ S 
TTAKTCA*MCrATVCITVKKWKHRLLG 


441 


14342 


A 


447 


38 


393 


VILHRQGLSL/VTQAEVQWYDYNSL*P* 
TPGLKQS S CLS FLKS YCHG* LFWVWF 
KWMX GFTMLPGL I LNS WPQ / CNP PAVDA 
QI AGIRGFHS / VGQAGVQWHDLS SliQPIi 
PPGFKQFSHLS 


442 


14343 


A 


448 


54 


540 


RIPFEHASGFLQSSHQKPHCLLHPLSGQ 
VSSDGQ\FRKFGISRLGNSGIYKAAFPL 
HDCKFRRQSEDPSCPNERYLLYREWAHP 
RSIY*KQPLDLIMICYYGEKIGIYFAWLG 
YYTQMLLLAAWGVACFLYGYLISrQDNCT 
W S I E VCHPD I GGM 1 1 MS AQRDRL 


443 


14344 


A 


449 


2 


310 


FFFLRQSL/DSVAWAGVQWRDLGSLQPP 
PPGFK* FSCLSLPSS WNYRHAPARPANF 
FLiYF* *RQGVTMLTRMVIiIS*PRDLPSS 
ASQSAGITGVSHHAWAKISIi 


444 


14345 


A 


450 


2 


466 


KQKI FS VDSTAFYWKKMPSKTFHS * REG 
TASKLHTAG* iLIiGPNAAGDLKLKPVFG 
DDSGNLRALKNYAESPLPVLYKWNTTKAW 
MTAHLFTAWFTEYFKPSLRS \FRKISWK 
I *LiFMDNGPHHPRALMEMCKE/NAVFMP 
ANTTS ILQPMDKGVILiI 


445 


14346 


A 


451 


670 


212 


SSSSP*GS*YQ]SrRTNTIPSPKKKQTKK\ 
KNWHYKSIFFTDAEILSWILANAIQQSI 
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SEQIO 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M'=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W-Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














KRFIRHEQLGFILWLGWVNIQTPTNAT 
CYTNRLKKRNHMTILINAEVLYKIQPSV 
I*KLFPLRGAF 


446 


14347 


A 


452 


407 


219 


PL I S LRWENHLGPGVQGCSEP * LCLCTP 
AWiyiTE*YPISPSQNKTKKQ\HTRKQNQH 
KNKCVKN 


447 


14348 


A 


453 


2 


395 


WFLWFRERSHLH/ RVQNEAASADVEAAA 
SYPEDLAKIIDAG\AKQIVSVEETAFHW 
KKVPSRTFITREKTTSALKG/RC*LSLV 
DNAAGHF*VEA\MLIYHSDNPRTL/KN/ 
YAESTIjPVLYKWNNKAWMTACIjFTS 


448 


14349 


A 


454 


424 


38 


eetepii/hrpissaevelvikniipt/kh 
ksprpdgfi aefykmqke * lvp i llklc 
qkikeegllpnsfyeasisliqVnsgrd 
tv/rkenfrpifpmnifakilnkiianc 
iqqh1nkdihhyqvl.fsslhsr 


449 


14350 


A 


455 


2 


309 


PRVRSQTPGHKRSTCLGLPKCWDYRHBP 
LHPATASFIjVAAVPGM/FADPP/CNMHIi 

ne*mne*mngddaseilsfemrshsvtq 
tggqwcshsspqpqppgppk 


450 


14351 


A 


456 


3 


441 


dawglvldrerpffifffffflgnggyf 
rgpggrpggegppmetsvs pvkkilrgp 
gqgggicarkphplggprgpnhktgg*kr 
ayptgenpvltekpkftgpggngpetpv 

igkagagkplnpg\mggsrnpkwahcp* 

TGGKKGK 


451 


14352 


A 


457 


1 


234 


ptrpnhlglgdrgcreprs shgtpawat 
erdwvskkkticg/ekf*kgrtk*ms*ve 
qggsggkgiftlaisnsplpff 


452 


14353 


A 


458 


42 


470 


kripqlkksplplknppgewvgkinfpp 

PRERPKNFF*KKKTLKTPPKQKFLiRKKS 
P FKKPHFLLYS KAT KKQKRGKKKRAP P P 
KKKKKKENPKIVIFRTEITVASPVLSWA 
VKPIIHIFFPREKKPHA\KPPPP 


453 


14354 


A 


459 


2 


393 


DRP I EQWNKRERPR INLY I DGQM I FS KG 
TTAFE/WK*ISrLCNKYVWNAWIC\NRRKN 
PYhTSSRKINLR * I XDIKVK/PKAIKLP 
EQNI RCSL * VLGVGRDFLENMNYTRKK\ 
I GKLD F I KI SQLGMVADTSNP I 


454 


U355 


A 


460 


338 


33 


GDEKIEKSFFFPISSLCFVP/AYL*QQI 
RLLI FTRNSRLLF I Y / CLFLEMGSHFVT 
QTGVQWCHHSTLQPRPLGLKQSSHLSLL 
SSWDYRHLPWLKWTEACAV 


455 


14356 


A 


461 


77 


435 


AKEVEASLSTLARPISIQQQQQQQQNEK 
KNKN* PRIAKAI \LS * KGEITLP/ELQL 
CYRAMITKTAWY*HKNRHIDQWNRRBNP 

ETNPHT YSELI FDKGPKS I \ SLFNK* CW 
EYWIFICTR 


456 


14357 


A 


462 


157 


3 


NGRVDLKIQKLARCGGACLQSQL/RQEN 
HIiNPGEKGCSES*LHHCTPD*VTKQ 


457 


14358 


A 


463 


363 


3 


PSVAQAGVQWCHHSSLQPQPPRLR\HPP 
ASA/SQ*LGLQGQATAr,yiHKYILFPCN 
VLISF I FPPPFLEMRSHSV/ SQARVQWH 
DGSPQPPPPELKRFSWEAELAVSRDRAT 
ALQPRRQ 


458 


14359 


A 


464 


3 


396 


LKEIAEKVKKIIKFKKQLRL*MKKI INR 
YLKEN/LNQLLEIKGTLRELQNAVESFN 
NRI,*QIEEGISKI,EDKAF*WTESGKIRK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
XJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine> H=Histidine, I=Isoleucine, 
K=l.ysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=ProIine, 
Q=G]utamine, R=Argjnine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














INK* TK* TDYI KEPNLRI ID VPGGEEKA 
KSLKNLFSVIIEENFPGLAR 


459 


14360 


A 


465 


399 


I 


PPGVrtLNGPPFFFFFLILGPGWWGQPL 
KTPLVFFPNFQYHIFPFKISRIF/LPTR 
FPPQVPPLCP*PFSSPWGLKWFILLSF 
FFFI)*DRVSLCHPGWNAVM*SQLTAASN 
TW\VK*SSHLSLLSS*DYRCVLS 


460 


14361 


A 


466 


279 


1 


TTNIFNPRRVGSTDAGPLNMKEKEKEJSrV 
KRRQATGWEKI VAKDTSDKGLLS KI YEE 
L]:»KLKlSrEKTWK*/incrjCWSKGFNRyFA 
KEYIQMANK 


461 


14362 


A 


467 


2 


436 


RGELPQLDKYLKKTTAIILNGEEIi/E/A 
CPLRSRTRQGCSLSPLL*KKFFPPKSHI 
ESLLFFNI I LEVLYSAVRQENE I KI VE I 
GKEEMKMYLFTDGNHLCKNSERIRN/YS 
KVAGYKVNV*T* ITFLYTSREQVEFEIK 
KTLSMWRFF 


462 


14363 


A 


468 


2 


420 


RTTALP*AVRQGRLSL.QRLLLSFC\CLC 
PAARGEAYIG\RQASMSRCGLQAVQAS C 
LLCLPKQAWAMAGAPPSASLLPCSLISD 
RCASSQPDSVGVGPSEAGVGYNLWRGIi 
IjSRSEKR\NIRLGVTRFSRCV/LSPLSIi 
TRR 


463 


14364 


A 


469 


1 


416 


PSP*PLTGAIjSALLKTCGLAM* FHFHS I 
TLGILCLLTISrTLTIYQ*WRDVTRESTYQ 
GHHT\QAVQKG\LRYGIILFITSEGSFF 
AGLV*AFYDSRLPPTLQLRGHWPSTGIT 
PLN\HLQVAVLGEPVLIiAS * VS IT* ADH 


464 


14365 


A 


470 


211 


398 


IFFFPKMGGGLSIILCWRRKRHPQKLK* 
STCLGLPKWWEYRCEPP/VPGQKKNFRP 
* KTKE S P PLiVSGGAS S ASN I KL * I AL.P P 
FLEKKKFKKQGFKDPPFPLFFTQNPGQG 
NFFFGKI * YGPP I PFFFLRRS FTLVAQA 
G VQWHDLGSLQPP P TGFKRF 


465 


14366 


A 


471 


62 


424 


TLMHCWWKWKMVHIiLWKTVLNLLIKIN/ 
RTLNHT* PCNPAIPLLGVSLREMITYVH 
KKPCP * MF I VALFMVTKNSKHLKCSIiTC 
KWINKLWSLYTMKYYLATKRKELLWHNR 
SCRNYSSDMS 


466 


14367 


A 


472 


427 


20 


LGLRLPFVEQEENKPYYPLAPFSPPSPP 
MSLYKNSPIYPGKGFLFPSKNNKIPPLN 
FLKMSNLF* /FALLLGNKDFHLLLLGGP 
IGIMGDHiyiATTSOMFYSRVP I FF/CFFE 
TGSHSVAQAGVQWCNLSLLQSLPPRLK 


467 


14368 


A 


473 


424 


98 


NWYLYYYYYYHLLF^RQGIi/DSAAQAGV 
QWHHHCSLQPRRHRLKQILPPQLIi\TSW 

NYRHKP PRLS P YVAQAGL * LLGS SDLPD 
SVSQSAGIIGNEPLHLASL*FLFAFP 


468 


14369 


A 


474 


415 


75 


NHFLKFFPQTQANGPGGKNKFFFLKTRF 
CFFPPGKRPWAYYKSLQPPNSGGQQI \S 
APTP*IKGAPKGGPPTRVKFFFFFLF/L 
RWSLRSVAQAGVQWHDLGSLQAPPPGFM 
PES 


469 


14370 


A 


475 


3 


413 


PVQKGLRYGI \ILCITSEV\LFFAGFF* 
AFYHSSLAPTPQLGGHWPPTSITPLDPL 
EGPLLITS VIiLASGVS IT* AHHSLIENX 
NRDQI IQALLilTILLGLYIiTLLQA+EYF 
ESPPTISDGMYGSSFPVATGFHGLHVI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K==JLysine, L=jLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, '^=Stop codon, /^possible 
nucleotide deletion»\=poss!ble 
nucleotide insertion 


470 


14371 


A 


476 


1 


440 


ITLTDRELSYY*A3SrRLL*LAYTITFIV* 
I PLYGLHL* L\PKAHVEAP I AGS IVLAA 
VLLKLGGFGVIRLTLILNPLTKHXAYPL 
LVLSL*GIIITSSICIjRHKDLKSIjIAYS 
S I SHIALVVTAILIQ'TP*SFTGAVILI I 
AHGLTSS 


471 


14372 


A 


477 


2 


397 


L,FVYNHIDVGALYLLFGA*TGVLGTGLS 
LLIRAELGHPGNLLGNDHIYNVIVTAHA 
FVIILFIEIPIIIGGFG\N*LVPLIIGA 
TDMAWPRimiSL^LIiPPYLLLLLASAI 
AEDGAGTGGTDYPPLTGNYS 


472 


14373 


A 


478 


442 


44 


S SSS P FTPPRGKFFFKKTPRKKF FS / S P 
G]SrKGFFSPIiSP*KFFFFFNPFFFFGGFF 
PNPPPPKKNFFPKNSPGFFFPPPLKKKI 
FFFPPPLNPAPPKVPFKSPPPFFFFFFF 
FFFFPFFFPLRGHPWGREGI 


473 


14374 


A 


479 


1 


439 


PTRSPTRPLiVLDRERPPFFFFFFKKGPP 
S VPPAGGGGANLG* GNPPPL/GVKKFFG 
PKPPKIGE*RPWPPPPGQPECSFFIiKKWr 
GLPHGGQGR*N\PPPRP 


474 


14375 


A 


480 


420 


0 


YSPPSPPK/YRAPGKKFF*KKPRKEKF* 
KKKILGFFPPLSPLKFFPFPKAFKFFGG 
VGPNCPPPKKRFFSKNSPGGFIKPPIjKG 
KNFFFPAPVKFGPPRGFFKGPPSFFFFF 


475 


14376 


A 


481 


371 


2 


NKI*NKFSKLNSKKSSSSSSSSSSSSSS 
SSSSSSSSSPSVNK*IICKCSTSLTISSS 
SSSSSSSSSPT*IAKIKNIDSPKCWQGC 
RETGLTV\LVQPLWWQFLLKLSMQri*D 
SANMILSMHARG 


476 


14377 


A 


482 


3 


335 


HASGKDRHTDQRNRIKNPETDT* lYSTF 
*QKC/RLI*WRKDSLVNKWC*SNWA\SP 
iyiKKIKLDIiSSSSSSSSSSSSSSSSV*NV 
KliLGNNVGGNLQYRGLRVHTVDVKAQHI 


477 


14378 


A 


483 


1 


418 


GVR*FSPLNPPSRWGPKHGPPNL\LNFF 
FFFFVFLVETRFHPVGQAGLELLGSRAP 
PASA/FPK 


478 


14379 


A 


484 


1 


356 


FCANAVRSM I KT VLI DKATKP / RAWKGN 
YKYHLPVFNCKTGRT\DSGNPLNWFYQC 
FVPEIRKYIARVGLPFMVFLILDNAPGH 
PEPHEPNTEGFRWFLTPDMPLIQPLEQ 
GVLRTLTA*YQCFVPEIRKYLARVGLiPF 
NVFLILDNAPGHPEPHEPNTEGFRWFL 
TPDMPIilQPLEQGVLRTLTA 


479 


14380 


A 


485 


166 


406 


F I FWGS VS S S AEGGGWS WS S LPRVTVR 
PDETVDVTIHLLKE * CRPGAVAHACRPS 
TLGGRGGRI/TLRSGVQDQPSQHG 


480 


14381 


A 


486 


107 


429 


FWVTQTFGFFWLNPPGGLELWPPPPCPG 
NFGGNFKKKGGPPLWPGGVQTPQPRGIT 

PPGPPRGGK*RGGPP/SPGPGFLGKPKG 
GGP 


481 


14382 


A 


487 


399 


2 


GS PAPGLPKGWGFRGGPPGPGKFVFFKL 
PKGGPPGAEIPLGEFFQRGiCKGLGPrPP 
PVFLKTEEGGTLS/NSPFKGRVPLVPKP 
KAGPKKKKKKTNYKP I YLNNTDTKFPNN 
MIiAlvIRI *QCIKEVMQYDRTRG 


482 


14383 


A 


488 


419 


2 


PPPPPPPREKKGGLYFNPKIFLGKSFPP 
RNPKDPS*KPRVGIFFEGPKPPKKKKF* 
FGPWGKPPGDG*RKN'PPK\PFFFFFPFF 
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SEQII> 


SEQID 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenyf alanine. 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pronne, 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon^ /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 














* DGVLLGRPGWI AMAQSRPGLQE WHS I S 














KKRSGPCVLGISSKEVPDA¥!ADAWADAW 


483 


14384 


A 


489 


629 


0 


SSSSR*SLLCKCFKN*GEKIRNPIEKWA 
KVMHRLFTKRCINI *KDVRSTSIjMVRET 
* ITTTIiRYHFSPIRIiSKT*KIjNRTLyG* 

DCGETGIFIHCWWECKMF*VL*RE/VWQ 
YLIKLL/LHIPVDTA 


484 


14385 


A 


490 


2 


351 


KNRPEMDPGM*GY*VNDEGDTTYQQGNE 
EliRS /WCWDDNRLAKVKLDLYFPPHITI 
EINSKWI *DIiNIKNPPIHVLEENMN/I * 
F*HV6IGKHYLTI*IiKI*NP*K**IWLH 
ENKMYCL 


485 


14386 


A 


491 


42 


507 


NLAK* IQ* * IQTIMHHDQVSFIPGIQGW 
LNILKS INIIYYTNRIi*DKTHITISIDA 
EKAFDKI*yLFMKGK\KPLSQLGIEGlSr/ 
YLKLIKGIH 


486 


14387 


A 


492 


18 


417 


REGKKSRVHHFNIKQ6RVMSTKH*KTQL 
/SHSNAHSNKAQISKSQPHGLHDPFKKK 
KKKKKKKKKKKKRGEKKKKKKRGKKKKK 
KKKRGGGPP*KGVREAriFCFFWRKKFFF 
VGGGGGKTPLGCLQADTPtiWGA 


487 


14388 


A 


493 


413 


82 


FFSSPPPPIiPFFFPHLFPPPPNFFFFFS 
PPPFFFF/PPP*KTKFPPPPPIiFFPPPP 
LFFFSPPPPPFFFFFSPPPPFFFFFFFF 
FFFFFFFPFFFFFFFFFFFFFFFPFWL 


488 


14389 


A 


494 


1 


413 


PTRPPTRPPTRPPTRPVLDREHSPSNLP 
KKKKKKKKKKKKKKKKKKGGGPLKKKPG 
GAKKKG /EKKKKKFP * KKGEKKKP PGKF 
GKKKKI WGGEKRAKTPQKKKTP *GKKKI 
LKGEGGKKNPKPRGGKKFFSGEKKKKK 


489 


14390 


A 


495 


274 


1 


lYRIDCAYMKKVERSKISILSFHIRKIiE 
N/ QLNP K* /RRRRE I KIGAE INE I ENRK 
* lEKINETKSYFXWKISKPIiAKLIKEKT 
QITNTRNRAY 


490 


14391 


A 


496 


336 


1 


VFQYTYNKIiVSMFYYCFFL^RWGLTVLL 
RLVSNSG\PSD\LPSSAS *VAGTTDAHR 
HTQLCFTTIiSDLLEPYFHLGSFIFSLn* 
FFFLNTESHSWQVGV*WHNLGSLQPLP 


491 


14392 


A 


497 


445 


382 


PRVLIFFFLTPRGPPPPPPPNKVFFPPP 
PPQNFFFPPPPPPSWGGFAPK/PPPPPK 
SFFPPKPPPVFFSPPPKEKKFSFPPPPH 
FAPPPVFF*PPPPQ 


492 


14393 


A 


498 


430 


2 


SPPPKPTRM* \KVKFGR*KKGGGGGGGS 
PPLSPPLWGSKPWP*VPKGR/PI,PG*A 
GKPPFFLKFQPFPRPGGGPRCPPLFGGL 
GOKNGFTPEWLLIi* PKFSPFPSSLVDR 

RIYCLLKMLSYKSYMMILLFASCLNLYT 

FPIV 


493 


14394 


A 


499 


363 


3 


KKLVTPARVLGDI I PRNRFLQMPQQREN 
PLCQVWMTKPPTTIPVKTKTGKWYLISIj 
KRFCPAKKKT/ IKILVNRVNRQPPEWEK 
IPAJSfYASDKGLISSIiYKDIiKQVYKRKJJN 
PITK*VKGMY 


494 


14395 


A 


500 


145 


435 


VFMCINSSFLLFIPWYEYTTVWPVTCRR 
T*MFIAAIiFVIARNWN*SRCPSTGNW/L 
KLWYIHTME 


495 


14396 


A 


501 


162 


1 


FYNYTIIFIYLFXFIFLRRNL/DSWHT 
GV*WRHLGSLQPLPPKFKPFPRXiSLP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A==Alanin > 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r==Phenyl alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu ta mine, R=Arginine, S^Serine, 
T— Threonine, V=Valine, 
W— Tryptophan, Y==Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


496 


14397 


A 


502 


343 


2 


IKKFRWGPKPPLKEPPGV/SPDFPII/N 
FGNPGPVPGKDFKFPNFPKKKKWGGRAP 
PWPTTPGG*MGRFP*PPAFGVPRGGDG 
SPPPGAPKEGPLSKKKKKENEKEKERKE 
KSARL 


497 


14398 


A 


503 


23 


405 


KGRWYLWEKKKKNMLVRGKIGGGPPPRI 
PKF*K.VIFKTPPGPPFFFFGPPIiNFFFF 
KTDAPIiFF*NHPPKSKIWAIiAPPKKKIP 
LNPKKPPPFFFPPHFFKKNFPRAFFKNF 
\AFSPRGGASPSPPP 


498 


14399 


A 


504 


418 


247 


PPKWGP\FPKPPRGFFFPPPKGKKFFFP 
PPG*FGPPQGFF*RPPPLFFFFPFFFFF 


499 


14400 


A 


505 


26 


426 


GCTGLIiHS *MyAKVVCCTDHPir*VIiSP 
AFISRISYSPTHPQIiILPF*IFFLFLVE 
TESCHIAQAGLK/LPASSSLPVWASQSA 


500 


14401 


A 


506 


337 


2 


IPNLKESA/CPNLPKGWGFKD*PPPPSQ 
I SL I FKYPKFLKFE I *KKKPPLK:KGI.FW 
WV*RKKAPVGF*G\EMALGEKFFFFFFF 
FFFFFFFPFPRRSFAFVAQAGVQWRDLG 

s 


501 


14402 


A 


507 


454 


2 


TSKTGQPGRRGSPFPI*WAAGQKRSSPP 
RRGSRAEAhLTSQTGNPGRDTPHFPDDG 
RPGRGAPLFPDGAAGQRRSSLPRRG/VP 
GRGAP \QSQMGCQLGRGAPHFPDGVAGQ 
RCFPFPRWGSQVEALPTSQMGQPDRGTP 
HIPDGVAAGQRR 


502 


14403 


A 


508 


390 


13 


RIPPPKSRWKGKGPF*VS*NPRPQI*KN 
FriPPPP*K:/YGDPRGPPPPPIKFLiPLKK 
KGAP P I CPGCFE I PAPRES PPLAPPKS * 
NSRGNP PP PP F FKKKNPLFWGGKTKKLK 
I FFFFFEAGSC YVA 


503 


14404 


A 


509 


2 


282 


WQFLTKLNILPYEPVIMLCIYPNESKT* 
lYTKTYA* ILLAALFIMAKTWKQGVLKK 
V\TDKVWHIHMTEY 


504 


14405 


A 


510 


400 


250 


LLSVTQAGVQWHDHSLLQPQTPRLiKXHP 
PTSAS * VAGTIGAYHHAWI IFFF\ IFI Y 
CRDAVLLYWLGWFPKGLLKCWDYWCEP / 
RMSGLFVFWSDLGFRSEVS * ILSSWDYR 
CVPPCLDNFFF 


505 


14406 


A 


511 


99 


242 


VTKEKEGBFXRIK^hlHQEDTTTmXGV 
PNNRTLKCMKQKLTEIiKKE I 


506 


14407 


A 


512 


139 


374 


SLWGKRI FFFFGADPHFVPQAGGA\WGN 

HG*LQPTPVGLKKSSLLTFPISWYYRLG 
TPPFA^TFKIFCKNGVKPCCPACF 


507 


14408 


A 


513 


23 


402 


STCLRIiP1CCWDYRRESPLPALCMIiFIj*M 
KGKYVYMGFFI XlFFFLFLHKI * LLQEN 
FVyVCIiKKWKTia:iICA*TKKQKKNErG*N 
ILVDLNVLS *KKWKC3:iIiWGFIIiFYF/NF 
LRQSL/NSVAQAGVQW 


508 


14409 


A 


514 


7 


386 


FFVFSTHITLLFLFLFFFFLFPRPPGFF 
LA6KKNTPPAPPEKPPPP\PPKKKRGPP 
FFFFGPPQKKPPNSPRGGRGPPPPRPPR 
KMEKGGPPPPREIPPPPFFF*PPKTPPP 
PPGGGGGGKKTPPKR 


509 


14410 


A 


515 


370 


2 


FVFFQWLPE*NSFIiFSPF*KGL.C*RDL 
AFKRFFP WPPNKKLWPSL F*GFLKUTS * K 
FNFSLKSLE VPFFF\ CPPKFFPW I * AFP 
RFFPFFPPPP*RG*IFFF\*DGVLLCHP | 
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SEQ IP 
NO: of 
nucleotide 
sequence 


SEQIO 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Ammo acid sequence {A=Alanine 
C—Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lys!ne, L=Leucine, IVI=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S^Serine, 
T^Threonine, V=VaIine, 
\V=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWSAWQFWLT 


510 


14411 


A 


516 


I 


163 


NTREKhSGGGGTQl.S * /LhGRPRQKNHh 
NLGGGGCSEPRS\HCIPAWiyiTE*DPVSK 


511 


14412 


A 


517 


1 


390 


RIVRVYYKHLC/Y*GSWPREKIDISrFLD 
TFNLSRLNQEEIETLSTPISSSIIETVI 
KSPPT\KQKSPGPE 


512 


14413 


A 


518 


47 


312 


E YTGL IliRL YIE / LFTTQH* KQKALNQA 
K*IiAKGLNKHFIKRGayiNDPKHMKKSSA 
SLAIRKM* IKTMMRCHYILT*iyiAKIENK 
TESTE P 


513 


14414 


A 


519 


320 


1 


QNPINKQ*KKNNKKYRLV*HKKPPPFTT 
. QPHQSNQHQ\PN**PQKHT*TKNHHTPP 
PPP*SPPPPSLSPSPPPPPSPPSSSHPP 
LLSPSPPPPSSPPSSPPPRTSP 


514 


14415 


A 


520 


1 


368 


LKTQQEANNLILKWAKDLNRYLIQK/VY 
RNVKHTKRCPT S \ RELQ I KTRY Y YI SIR 
*RVICYKKRDNTKC* *GRETTGTLVIHYS 
*EYKMV*PLWETV*QFLTKL/E*SY/P* 
DPAITLLGIYSKELK 


515 


14416 


A 


521 


2 


264 


GKKIPMLYFAEIEKCILKFV\KRS*KAK 
* ILPKNNKAGGLKFPDFKTYYKAAASKH 
QVGGLWKELRSTLQAQQTWVPREELRST 
SPS 


516 


14417 


A 


522 


1 


408 


liEKMSTSIAI/RVMQSKTTMKCHYIPIR 
KAKI*NNDNIKCWQ*CRETGPLIYCC*E 
CT/LVQPL*KTV*QFLIKVNM\DPAFVL 
I FI PKK* KHMFTHKKNCKHTFRAALFVTVI 
ANTRI I PNI FQF VE WliNKLSYKHI VDY 


517 


14418 


A 


523 


320 


3 


TYIYIKTCPRMFT/AVMLTITKKQKQPK 
YPPTDE*NV\YIHSNAYYSEMKRNEVLI 
HATTV^MNPKNIMLNKASSRKRPKT1*LR 
I YEMSRI IQYMVLCNCLLLPSMF 


518 


14419 


A 


524 


426 


9 


AELPASPTPGTGTPQPLGSGRDQVP\GA 
VGGTHPGGSGLAGSPP*GGLGMAGCKS * 
ALPRGEVTEAWRQFKCGERRQASSTGGP 
GAPSAAAGPGAKPIiTAWGQRHQPAAPSA 
GPAEPLPTQNWCWPASNPGSRTRLSLHT 


519 


14420 


A 


525 


13 


461 


IGIWRKMNILLPYTKLNSRWIADLYVKG 
TTIMFLEVSVREHIiHDSQKKIF* *DAKL 
MNQKE *LD ILDS INIKSFVH* KAPLREN 
KGKLQSAIHIYEKGIjVSRIYEELLQXTY 
KETLKANKKKIDNPIEKWAKNLNRHFTR 
GCPSIYKHVK 


520 


14421 


A 


526 


288 


570 


ATGSLCCPGWS I VAIHGFNHSAL* LLTP 
GIjBCHPPASASQVAGTAGMSDCTQLCKIN 
FFVFVFIi/RRQSIiAVtQAEVQ 


521 


14422 


A 


527 


3 


438 


AVSHDCTTGLQPG*QREILYQRKICKGTP 
SPLPPPSPGIGGKPALPFWGPKTWPPPG 
IPPFLFSPPPPSQSGGGPRPLETLAGSF 
SQNPPmiiRDGAPPPPGGGFFG/CPPPGP 
PGGLG/PPGAPPPPIiKNFPKKKGGGGGF 
LFPPQAKG 


522 


14423 


A 


528 


2 


616 


FFPDTEWRSVAQSGVQWRDLGSLQAPPP 
GFTPFSCIiSLQSTWDYRRPPPRPANPFL 
YF* *RRGPTVLARMVS IS *PRHLPAriAS 
QSAGITDVSHRALQVCFITTLL* LSK/H 
QFBCKAGVTL/PHLQCLHEIGLDCVLHKH 
/WSVHHSPP/HKTNVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R==Argininej S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=p05sible 
nucleotide insertion 


523 


14424 


A 


529 


57 


485 


RHS SLGNKSETQSQMKKKKREKKKGKE I 
SFDPArSLSI*PJCENKSLyQKD/TCICM 
FITA1jFIIAKTQNQPKCPSTDE*INKRV 

Myiyiyiyiyimkhyspikkneim/sf 
aatwmerieaiiissiwqkqkakyclpsc 

I 


524 


14425 


A 


530 


399 


3 


FFFFFFFFWENPPPPPK/WRGGGKKKAP 
FFKFFKYKK*]:iFFP*GGGGGHQQKKKKK 
KSSSFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F*FKGFLLGTTHASAHASAHAS 


525 


14426 


A 


531 


491 


4 


SKWI KDLJSrLRTK/ TTKFVEET * VKI FMN 
LGLGNDFLSMTPKA/LKIDKSDFMKIKN 
/FCSSKD/TIKEVKRQHTEWVKXFTSYI 
FDKGSTSRTCKEL/RKQQQNKPIQKCSK 
VLNRFFFREDIQLSNKLMKGCSTSLNH 


526 


14427 


A 


532 


1$4 


509 


pqwpahs flpalgs sgtgp y* wrq i fd 
skdkessq/wshetsdrpkpadhrrrsr 
pslatspxprlephpslpnhsgiipilss 
lpwgavatiapthfsalawp^rplpcnsq 
gekfffwfgffXerbshsatqagvlghd 

LGSIjKPPPWGSKGFFCPSYSGG/WNQKN 
HLSP 


327 


14428 


A 


533 


401 


3 


VIREMQIKPLHIQ/ PGWIi* SKSVKKY*Q 
GCGQS/NVLIHY/WLKT*IMGPFWKIVW 
QFLIKLPYGTVILLSGIYILWRMEKKPP 
KFCTQIFIAAFFIIQ*PKSENNPNIHQP 
KNQ^DVHMYIHTMEYYLAIRMNIH 


528 


14429 


A 


534 


419 


93 


SIiAIRQM*IKIPPRYTRVAK\IEKSHNT 
KYWKRHGATGTLIHCWWSCKMA*S*KIW 
*FLIKSNINLPFNPVILL*GIYPREiyiKT 
CLYKD\ CM*MFMAVSFIHQNGKQPKCL 


529 


14430 


A 


535 


414 


2 


NPLARGY INCGPQ F FFFP PRKNLKWGVP 
LSNFPPKL*KGGIFGEGPKKVKKNFFFF 
FIiFFFFF^KKPPCSPRLKAK/WKIFGPL 
KPPPPGLKKFSFLTPKKNGDKRGGPPRR 
GNFFFFFFLVFLVKTEFHYVGQAGLK 


530 


14431 


A 


536 


32 


435 


DRATALQLG * KS ETS S YKKKKKKKKKKK 
DGGAPLKKNPGGAKKKPGEKKKNFSPKR 
GGKKNPPGNFEKKTNFGGGKNGAKPPQK 
KKT/AWGKKKNLKGKRGKKNPKTLGAKK 
FPPRG* KKKKKPPAARPGKASS 


531 


14432 


A 


537 


348 


1 


EYIKSTHQMGKNYPLKNTVSSGQVQWLS 
PVISAQFAIAKSWSQPKCPSLNEWIKKL 
WCV/WCVCVCVCVCVCMMEYHSAIKRNE 
LMAFAVT*MRLETI iisevtqewktkhr 
MFSL 


532 


14433 


A 


538 


523 


66 


TDQTSHNIPLSQSLIQSEVPTLFNSVKA 
E * GEE A/ AEHKFEASRGWPTRLKERSRL 
RNMS V\GEAAGSPEDPAT V/ INGGGRTQ 
PQIFSVAEATLNWKKTPCRPFIGREKSM 
PGFRASKDSLTLCRDFKTHLCLVPHYWN 
AKHVQVYILLKVTAKI 


533 


14434 


A 


539 


413 


2 


RDKV*PCCRSWSQTPELK*STRLCIjPKC 
*DYR/R*APAPGPFIiL*VHRHVSIFKSG 
PLSCRCSWFQD/HDSIKPSFPTIQYAHG 
NKFQL* TPDTLI FFLKWSFTPVAQTGVQ 
WHNLSSLQLLPPGFKRSDAWADAWADAW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A]anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProUne, 
Q=Glutamine, R^Arginine, S=Serlne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


534 


14435 


A 


540 


385 


2 


mas khi kt/ carqlairevq 1 kttq yhv 
iptrmakvkkt/dnscwhgc^ntlincw 
wdckmmqldwkk/svwhssrgkmyvqlp 
ctpaipiilgiyteijktcshkn/t/ct*t 
fiaallvxaekwkXkcpsaeew 


535 


14436 


A 


541 


44 


398 


RPPFFFFPKKJRPIiWKKKQGFPPPA*RGE 
I PS AKK\ PPLGPVSGPGNPPGP * KP / PQ 
NPPPLGGSPFGKKPPLDIPFPGGPIKGK 
KWGGPPLAGGATTKN/PPPGGNFWKGGK 
APPFPSQKF 


536 


14437 


A 


542 


370 


10 


FFLRTKVSLFPRVEG\KVQSQIiLiAAPPS 
WGQVIPPLQSPE*LRQKDPFSPG6*GCS 
EPCSCPCLPAWMTEPNSVSKS*KKKRKK 
EKN* * KKKKSFINTTGKIRFRI IH 


537 


14438 


A 


543 


1 


370 


FLLRHILiLCHPGFCSVATAYCSRDPGSS 
D/PLPPQAPLPDQ*PRLQA/WHRIAPPH 
SANFF* *RQGFTVIiARiyiVSISQPCDPPH 
WASQGATATKADDYQK 


538 


14439 


A 


544 


50 


395 


IPGLTRQWLLDPCASPSTPPYT/ P*VQP 
STPQNSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAIIiEHTS S S SSSS SS S STCRNR 
NGYTYTVPEHRPARGHTASQTRKQVLAA 
THKP 


539 


14440 


A 


545 


1 


370 


LCSVTQAGVQRRKLNS VQP P P PRFKQVS 
CLSLLPFFFF* KRVFTLFPRLEGRGP 1 1 
FNGSPTLRG*GDPPV*ASKELRTKGGPH 
QGQLINIiF/CLGATGPTYGAQGGPKSPG 
LRRWAPLGPPRA 


540 


14441 


A 


546 


178 


365 


YKKTDATKTKMDK*DSIKLKSFSTVKET 
INKGNR\QPTEWEKIFASHTSDMGIISQ 

ICKELKQ 


541 


14442 


A 


547 


11 


236 


KGTTKLWKLSDFKS YFKAI VI KTVRN/ 
WHKYK\HIDQWNRIQNPEINPHIYGQMI 
FKKRAKNKQWKKDGLHKKR*WSNDFQET 
CQEQTMEKGRSSQEKMM 


542 


14443 


A 


548 


355 


3 


IKKKNLGRKRFCFFQEEKTWP*TPKKKP 
LENPGGVFPNPGFPPFFKAKIPEGPFPG 
V/SFF*RGGFKGYPPPLFFFFFFL*DGV 
SLCHPGCSAVAQSRLTASSASQVHAILL 
RHVDEGR 


543 


14444 


A 


549 


1 


373 


CFLiICRHTSHMWRIWFQTTAINQrWQ*K 
ESQKWSQ\LYI*KLHLLYEVQ*HFV*K 
YTQ*LLLEN"ADTK*AHATOKMVLIDIiLK 
AALSL I FNL * KNK *NLKNWPGMVAHAGN 
PSTLGGQGGWLTS 


544 


14445 


A 


550 


446 


2 


NIDKAPTVLGKMVSIFPTSRSPPLGPPK 
CWGHRREPPRPAYFGI * ILFLTPP *NPL 
NNSPQHLSGKISGIFSDPSLSVFFSSFF 
FAPPPKKFMIjCF*PFFFFFF*DRVSIiCH 
PGWGAVERSWLTAAPISQ\IR*SFHLSL 
LSSWDHRHA 


545 


14446 


A 


551 


342 


26 


WAPPIFFFPPYKRSPQKPFPPPGENNSR 
GF * IFG/RGGFFKKEGLSQFFFFFLKKV 
FFFSPGGEPQGYFPPPKASFLKRIFFPP 
PP IKKGDP6RGSPPRGNNNPFF 


546 


14447 


A 


552 


3 


167 


QPHLQDCL*QQNPVAVG*AFSSHPRDSL 
NNPW\WP6AVAHACNPSTLGGRGGRr I 


547 


14448 


A 


553 


3 


413 


TL ITALS SH* FFT* VGLE INMIiAP I PVL 



550 



wo 01/64835 



PCTAJSOl/04927 



SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
)ast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
K^GJutamic Acid, F=PlienylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iV[=Methionine, 
N=Asparagine, P=ProIine, 
Q-Glutamine, R=Arginine, S^Serine, 
T=Tlireonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TKKINPRSTEAAIKYFLTQATASIILLI 
AI LFNN I LSGQ* T I TNTTNQ YS S L 1 1 IM 
AIAIKLGIAPFHF^VPEVTQGTPLTSGL 
IiLIiT*QKIi\APlSFIYQISPSLKKK 


548 


14449 


A 


554 


441 


1 


SSPPPQARGEIFF*KNPGEEISQHNKKN 
EVFPPPPP*KFPFSPKAFIF/RGGGGPK 
RPPPKKKFFPKKTPRVPYKPPQKKKKFP 
FPPPVKFGPPRIFFiCSPPPLFFFFFFFF 
FFFFFFFFFFFFFFRMGCDRWHGKPWIL 
RDGF 


549 


14450 


A 


553 


3 


455 


NTNSSMYVCIYVFIIiRQSL/DSVTQAGV 
QWRDLSSLQPLPP\GSWDHRHriAQQ*YL 
FNVLLLEV\ FVYERGTIMVP IL* DCNVN 
CRVPGTRSLSVYCVMGIFYRC/HIESML 
CDRGFIFFDIGSHSVTQAGVHWCaq-IiGLX 
MKPLPPG*RNFSGLN 


550 


14451 


A 


556 


1 


299 


RRMRQENRWNPGGTGCSEPRSHHCTLAW 
ATEHDS VSKKKKKIRSSLGKKT* FYHNI 
LKAMG/ CITGIHKGPEGAWQREGHFTPP 
QGPKKAPQRGQTYPDP 


551 


14452 


A 


557 


3 


391 


PVWWNSFEASSGWLIGFKKRSCICNIKV 
QGETASASVEAGVS YPEDLAKI TDEDGY 
IKQQIFim)KTT/YY*KKISSRTFIVRE 
KSMSGFKASEDRLM* FLGANAASNF*LiK 
PILICHSKIPMTFKSRAKS 


552 


14453 


A 


558 


337 


1 


TRFPPFLGGFSTKFFYRPLVRFPLL/RE 
RKIFPLPPVFVGGFPPALGPFWGFLFFN 
FSKKRPFGPPWGFPF/ CFPKFIiPFJCPFL 
GFFPSFRRGFFFFFETVSLCQPS^SAW 
Q 


553 


14454 


A 


559 


117 


419 


I P PLLLGVGLFFFF F I RKKRGVFS PRWR 
GGGSNFGLLETPPLGIDPPSGLTPPKSW 
ELRAPPP PP I KF* KFFLKKNGF * WVS PG 
GLEISALLVFPA/SASQ 


554 


14455 


A 


560 


347 


1 


lYKELKQLCRKKSNYLIKKWAPCNPrTRHF 
SEEIQMANRQMKRCS\TTPVIIREMLrK 
TTLRYPAS PVKMAF I * DR* * *MF/ WQRC 
GEKGT/H/CWWECKSVQPL*RAVWRFLK 
KLKILLP 


555 


14456 


A 


561 


2 


375 


XPPPLQNGEKlFMII*TGABKAFYICrPF 
PFPIK/TLNKVGIKGNFPHIIRAL*EKP 
PAYIFHGEAESFPVRSGTRLCPLLLLLF 
HPVLEVLVRAI CPLK/E/ IKGTQIGKEE 


556 


14457 


A 


562 


391 


2 


ASGSKGKKKYPPFLRGSPPIiPGNPLFFL 
EGGEGKFP*P/RNWGPPPKrFPQKGQGN 
PPFFFFLKGPKGGGFFPPQGBKGVGFPI, 

PGEKSQRPF*GEKGGVPPP*RKKPPPFF 
FFFFFSETVLLLPKLECS 


557 


14458 


A 


563 


384 


3 


ISDSGVHPLGLPKRILLCQIiSYHVWP*? 
KCPC/PHLCLFMTLFQPP*GLPSVCAPS 
KQEEHGCFPISVARQEC*PLFPKNLDT* 
HPVGNFLKFFFFPF*DGVSLGHPGWSAI 
LTHCEJLCLPGSSNSLPQ 


558 


14459 


A 


564 


522 


120 


SLFLPTLBCNGTISAHYNLHLLGSSDCP 
ASASIiVAGITGMCNIiCLPSS IDPSTSAS 
*VAIDLRK* *GARGNCPVS/VLEKGWPE 
AEMGRGES ASWWDPRLGRPGVAKRI CRR 
HCCPVQQPQVSMEPSGPASPN 



551 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P==Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=:possible 
nucleotide insertion 


559 


14460 


A 


565 


73 


264 


KDRHIDQWNRIESPQIKSMTIHLWSFDF 
FLQGY* *KDRHIDQWNRIESPQI\NL*L 
YI YGHLI FFYKGTB/ IQWRKKTIjSSKWF 
WDNWIFKCKQ 


560 


14461 


A 


566 


413 


1 


EKKKKVPSQVLTPKPQVLGNFPRFSEEQ 
* LSLVPVS SLPLQSLSGKNQGP * ARRVA 
LCFGKSPGSQQIWF/RLP/EDIVTTVQA 
SYSKKRLFLSLLDFQYVLQRREGAVNSA 
NLSLAPW 


561 


14462 


A 


567 


397 


1 


FLGQDLTVAWARVQWCSHGSIiKPRLPGIi 
KGSSRSATG/SASPYLVPMFLSNLFWHV 
YIL*LYKT]yiTGI*]yiEM*FTPSCPMYSW 
RDV* IFFSFETEACPVAQAGLQWRSLGIi 
PQPPPPGPKRFSCLSLLTSWD 


562 


14463 


A 


568 


437 


1 


KFSEDPAKLIGEDGYSKQQVLNAN/ETA 
LYWKNMLSSTFIiARKETSMPSKLQGTG* 
XiLLGANAAGDFQIiKSMLTCHFKNPRALK 
NYATSPLPVLHKWNNKAWMTVHLMTALL 
TEYFKPTIKTYTYHNTTGSLTTPHASAH 
ASAHAS 


563 


14464 


A 


569 


234 


1 


FFPPFPFKASSPPQGTSSSRGVF/PPFF 
PPPKKGPFPKIPPGSSSPPPF*EKTYFR 
FPPFFLAPPGVFFR 


564 


14465 


A 


570 


2 


396 


FS*AFYHFGrALTPQLGGDWSPA6rTQV 
KS liQ VP VVNT\ SGLLASGEE IT * ADHNV 
I QNNRNRRI QALL I TVLLGL YFTLLQ AS 
EWFKIPFTrSDGIYGSTFFECAGLHGLH 
VriGSALLTICFIPQLTFDC 


565 


14466 


A 


571 


3 


403 


HASGLPSSWDYRRPPPRPANFFVFIiVEM 
GFHHL'NKAXIKS FACNEIQPLSAVS VAG 
LVGCV*VCKCLFPVL*Q*LFQF*/S/HS 
I ANWMREWPLRIiSLF * LI C * GERMSGFA 
TQSRRDPCSIiPGFLYQVLSIiAKFR 


566 


14467 


A 


572 


100 


371 


YKSNDFYVYGILHFLTTI\FFFFFFKKK 
KFPPVFQVGGQGGNIiS*LNPLFLGIjKGF 
S CLKLPRS WNWRGAPPPPPNFGFFSKNG 
VSPCNPG 


567 


14468 


A 


573 


371 


1 


REGARES / TWSS SHTPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEVPLLNTSVLIASGV 
SIT*AHHSLIEN 


568 


14469 


A 


574 


353 


103 


SL,QLQPPGIiN*SSHIiSLLSSWIYRHKFP 
CP ANVRFFVFFCRDGVLLVAQAGLRA/ S 
ASQSAGITGVSHLAQLVNPLLFYFSRAG 
D 


569 


14470 


A 


575 


1 


206 


FCIKKIIGFIPGK*GWFNISQPV*SLTN 
R I KEKIHISMTABP 1 1 S I ITBKVFDKIQ 
HPLILQKPFGQK*EYKFLNLIKGICEKN 
/PIPNI I YNGEHjKQGCLHiSPFIjFNIVI, 
EDIVIQSVKIKEGINIETQELR* SLTNR 
I KEKIHI SMTAE P I I S I ITEKVFDKI QH 
PLILQKPFGQK 


570 


14471 


A 


576 


157 


255 


YISPTPPFPPDLPF*NYLPNPKLTSPAP 
NFQIY 


571 


14472 


A 


577 


365 


3 


PPHPSVFFF*KSPPPPPPr*FFFPPPPFL 
FRGGPBYYSPP/SPPLFFSRRGKD'mPP 
PYSGLSVAGQTPPPPPRRTPPLVSPPPP 
PFFFF*RGGPKNKKAKKQTITFKSPPGT 



552 



wo 01/64835 



PCT/USOl/04927 



SEQBD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H==Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, '^-Stop codon, /=possibIe 
nucleotide deletion, \=possib]e 
nucleotide insertion 














NKPKHKNDRDI 


572" 


14473 


A 


578 


2 

• • 


323 


VGSWEPDBKVSHRELLLSFLFYPFFSHI 
SS*ETLIDFALTSTDIWALWHDAENQTV 
VKYINFE /HVWVT * FCHPTS PQAISC *E 
DERPGAVSHACNPNTLRGRGRQIT 


573 


14474 


A 


579 


369 


1 


AHLKGTLGGFPDFPF INLE FRGGLTS CP 
NPLNNWE *R/RPPPPPGKIFGAFFFFFP 
PKVETGFHRVDLLVIilS * PCDLPASASQ 
SAGITGVSHRARPTPG*FKKKIMTF/ CLi 
*KWWSHyVAQIG 


574 


14475 


A 


580 


394 


125 


AAEGEQGREGWRWRPRGRERASDRERER 
VR/ERGEERERGEERAPQ * SERWR/E /R 
SRERERVRELWSDSDRALEREAItLR 


575 


14476 


A 


581 


416 


1 


KGQDL YGE V* KVLLKVIKQVLNKWKDMS 
CS \KINILI SFP IRIALKFFF/EIjDE I / 
CSFV* *NKC*RIAKEVriKKKQ* CLGGHL 
/TSPHIRRSYNTTFSQSIWCCYRTKQID 
* WTRTQNPEKDLRl *GDLI YDLRCCSSS 
RA 


576 


14477 


A 


582 


3 


403 


NCFSQFNVI IME I PAKPFID INKLILKC 
FCKGRS ILKKKR*EDS /QRRNPLVrKTV 
WYWQRECHIDQWDRINNPE INLHKYSKL 
IIjDKGAKAIR* TKES / 1 / YS K* C*NWKF 
CM*EK/ IDPKLN-PL YK 


577 


14478 


A 


583 


402 


121 


QS L IHS KALTLFNSMKAEEAAEGKVE AS 
RGWFMRF*ERNHLHNIKVQVEATSAGVE 
AAASS PQD *GKTTDE /GGYTTQQ I PNVD 
ETAFYWKTMPS 


578 


14479 


A 


584 


255 


517 


I Y I FGF I PRGRVLL / CTHP S * S WL.QS * 
LTAALNFWAQVS /LPSSWDYRCTPPCLA 
FFFFFCRKGGSTLCPRLVGNSNPLKFPP 
LGTPCF 


579 


14480 


A 


585 


1 


414 


SSKSCCS KA* SSMGPS PAF YKL VPLFLC 
LFIWDGVWLCHPGWSAWQSLLTAASTS 
Q \ VHPS S \ CLSLRNSW/EYRYVP 


580 


14481 


A 


586 


392 


43 


OTDYRCEPPCIiAX TETGSSYISQAGIjEL 
LASSNPPMVCHPKCWDYSHETMHSAQNF 
FLKGIS*F*LCCSHFIHNHLLWLGKVIH 
TYTHAHTGLEKYKTQCLDVKCIYSDLLD 
GAIK 


581 


14482 


A 


587 


1 


394 


GTR/YGINLFITSEVFFFAGIF*AFYHS 
SIAPTPQLGGHWPPTGITPIEPIiEGPLL 
NTSPLLASGVLIT*AHPSLIE3>mRNQGI 
QALLIT I VLGLYFTLLQASEDFEDPFTI 
CDGI YG* TFFWTGLHGLHV 


582 


14483 


A 


588 


190 


3 


DGINLKIPGGIFFFLKARSHSVTEAGMR 

W/P/lSrYGSLQPRPPGLK*SSHLSLLSSW 
DHRHAPPSC 


583 


14484 


A 


589 


157 


387 


TKKRGRVGGWWIKILGINLTKEIKDLY 
SENYKTSLKE I KER/DTNKWKNI PCP*T 

ERVNTVKMS ilpkai yrfnvip 


584 


14485 


A 


590 


891 


0 


ppppfffflpalifffppppq/pktpkk 
kknppppppp\pppfship*qflspppq 
hhhfihhpppqppfhpfyynpfkffff* 
ni]srvpppp*k*iyff*tkffffipniff 
f\tkkkiflspppkkfpplltnlslpil 
iknllskppppqi 


585 


14486 


A 


591 


1 


496 


GTRAYQIVKPTP*PIiTGAXiSALLMTSGL 
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wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQIB 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F==PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Mefhionine, 
N=Asparagine7 P=Proline, 
Q=Glutamine5 R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PMRVHFHS ITLL I LRLLTNTLTI YQRWR 
DVSRESTYQGHHTPPGQKGLPYGIMLFI 
TSKGFFFARFL*AFYH\SSLTPTPQLGG 
HWPPTGMTPLNTLKDPLLWTCVLLASG / 
VS IT*AHHSLIENNRWRI lEAIiLMTIV 


586 


14487 


A 


592 


175 


2 


PFFFFFLiLRHSLTLAQAGVQWCDLCSLQ 
PPSPGPKRFSRLSLP/H*PGMVANFCIF 
SK 


587 


14488 


A 


593 


107 


733 


AAAAASKVLiyi*REGQLPGAT \GTGGVQA 
*APGSVA/ABGASVEGPGFGDTAPAHQG 
LS PTRSHGQGGAGRAS / S SQQGSPGGRG 
DGASE VWSGAL / S PGGGKDGAS AS VPRG 
PYAEAEKGGWALRGtiGGVAAPGPPSRAG 
QAPSGS / YTGPNARPAPWP I PGQGGGLR 
RDQAG*VSSWTGSTEPGAHTAHRAPGH6 
GKQGS PQQPHPQGPGQ I PT 


588 


14489 


A 


594 


10 


435 


FKWLLKSHAICFWTRS*SYCDNVCVPSL 
WAHHLGIRTEIPEFFIiSKFLCTSIIPHF 
TYRRQLRIiIQGSTE*EA*EDKLEQK*AL 
GAAQFTLPGMDVFVCFVFCF/ CLFEMES 
HSVT * ARVQWCDL6SLQPLPLGFKQFS C 
LGL 


589 


14490 


A 


595 


437 


3 


DEPKKWKTIPCSWIERTNI / VLKMAILP 
KAIYRINA/VPIKLiPTSFFTEVGTFSQN 
* KTTILKFRWNQX KRA* I AKATQSRKNK 
ASSIT/PDVASNYKTTVTKTARHWYKNR 
HVDQWNTIENTEIKLHTYSQ* ILSKAGT 
SKQWGKEHV 


590 


14491 


A 


596 


2 


498 


FFPFLGKTKPPTHLFFFPPPPFPIFFFK 
IFFGPLFKKKNPPK/PFFKRPPSFPKNF 
FFFSPPPPFNPLFSKAPPPIFFFFFKKI 
FFFPFPPLFKKSPPKTPL*IIiGFFPSFP 
FFPPGFPFFSLF*GGVSLCHPSWNPWL 
SRPSSKIjASAFRMPPVEG/SPFPPSP 


591 


14492 


A 


597 


1 


311 


rrvssesrwrslesrgrsleiqgrmerp 
rcqrfrenscvaprhckgpggkglqalf 
*vpvaqlgepgaqlgdpgahgeatvpev 

QGE / PAALLPGTAK\ PGGEGS PGFI PPR 
HCRGTGEEGSPGFTQGRETSKAQQRTGC 
LLSCRGCLCLSVSGDS 


592 


14493 


A 


598 


302 


2 


FLRFFHHKKFTKALNRHFS KD I L 1 LLS S 
NHMKGCSMSIiVIRD/MIiLKIT/MKYQFI 
P TRMA/ 1 1 KKTDRYWQKN / CEKI * TF IH 
CW*\NEKMVQPSWKAVWQFLRK 


593 


14494 


A 


599 


3 


386 


HT WP P P PRS PTAP PARTQP PS LQ S ALP A 
PQPGKKKKALRNEKG*NGSKKG/RGQPR 
PPPLRGPNGRTRSPAGICKGGGGFCPGV 
SKAQGAPRPGGRETWPGFFGRRPPLPR 
GPGPPGVLWTSRHCPLR 


594 


14495 


A 


600 


374 


53 


EGFFFFFPRWVGGQRAIiFWSPHFPPPGV 
KIFFPPPPPGWGGLRGLPPPPFKfPFPFL 
KKKGPSPPCPVFSQLPPAGVPPPPPPP/ 
TGLDLRG*PPGPPPPPPFFFPPP 


595 


14496 


A 


601 


393 


1 


VSPPKGCVSENKIPPPHNYYFAKTFVHV 
KLFGVAPLFIiQII.F*GPFLTSIRGIitL\ 
W I PYS PGY\ TLTLL I APRHFLRT 1 1 PVF 
RSVLPKTYLGIiSGMPRRYSDYPDAYTT* 
NXGAFRIARESWGRKIjDr. 



554 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine5 D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histldine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y—Tyrosine, 
X=Unknown, *=Stop codon,/==possible 
nucleotide deletion, \=possible 
nucleotide insertion 


596 


14497 


A 


602 


1 


155 - 


FCFVT * SG VQWRNHDSLQPLS PGFK* FS 
/CSSWDCRHAPPHLASFCIFRSTRP 


597 


14498 


A 


603 


170 


455 


KPNASLKRVLILCIQLQSSRQQKNNPT/ 
VKKWAKDLNILFS TEDL * TVNRYIKKYL 
TSLVIKKYHFIPTRIjSKMKKTSHSKCWE 
SCGEIETIiTCSW 


598 


14499 


A 


604 


3 


449 


TLLSNLEEAKKKKEDALNETRES *TKLK 
ELPGVCNETMMALWEECKPCIiKQTCMKI 
YARVCISGSGLVGRQLEEFLNQSSPFYP 
WMNGDRIDSLIiElsrDRQQTHMIjDVMQDHF 
SRAFS I IDELF\QDRFFTREPQDTYHYL 
PFSLPHRRP 


599 


14500 


A 


605 


169 


435 


KYWTKINRR/VAPN*XSIjSPEKKKKKK 
KKKKKKKKKKKKKKKSASSSKASPSSSR 
GG 


600 


14501 


A 


606 


453 


29 


TSPPPPPGENFF*KKPP*K/HFFPPPQF 
RFFFPPFPLKIFFFPPSLFFFWGVFPHF 
PPPPKKVFFPKSPRGFFFPPPLKKKFFF 
SPPLPFFPPPGFFLSPPPFFFFFFFFFF 
FPFSPFFFFFFFFTVKNFYLLCYFYKN 


601 


14502 


A 


607 


170 


408 


NEYDHFSIIKERRDFIVCVFFPIFSR*S 
PTWTQAGLSSIiQPPLPGYKRFSCLSFL 
CSWD Y /RWHPQGS ANFF VFLVEMG 


602 


14503 


A 


608 


2 


448 


SLiHPVI YSEGIKSRQSPCWR* * KWKKK 
KKKNPNQKPKKKKN/RPKNPLGGGKKFF 
*PEKKPGPGKKILKEKGKKICPFPPQKK 
NFKILKKKKKGAPLKKNP*GAQNFPGMK 
KINFFP^REVKKNPIGIPKKKPLFWGGP 
IGANPPPENL 


603 


14504 


A 


609 


441 


28 


GGTFLKGIKSAPKI *KGFPGWGGARFPA 
RDPPQFGG/PKGGGSLSPRGLNPPGPPG 
* TLFFFKKPKLTGGGGPPPLI PALRGVR 
P*NSLYPQRGGGAQIPPGPPPPGGKRAP 
CPQKKKKPRRKEKKK 


604 


14505 


A 


610 


445 


1 


LWLKKNTGGGGGSPPLFPPLGEPKRD/G 
FPRGRGFGPPPPPIKNPPFFLKWQNNPG 
GGAQPGIPGPWGG*GGEFPLPP\SPGFH 
*PRFGPFPPPRGTKREPPFQKKKKIANC 
FLLSDKSLLLEEAWGQWGPSPLEPTVA 
PKPNPRGKAQ 


605 • 


14506 


A 


611 


253 


437 


KKKKVQDMFSENFKML/NE I KATLNKWK 
D I SYS * VRRIiCWKMAXLPT/ IQS YRLN 
IIPIKI 


606 


14507 


A 


612 


186 


2 


KKKKVQDMFSENFKML/NEIKENLNKWK 
DISYS *VRRLCWKMAIL\P\KQS YRLN 
IIPIKIQA 


607 


14508 


A 


613 


37 


453 


KTPPPGENFFLKKPQKKNFFPPGNWGVF 
FPPSPLKFFPFP*TFIFFGGVWPWFPPP 

kk:kfffknspgvff/sppp*kkkiffsp 

PRYFWPP/HGFFFKGPPP 


608 


14509 


A 


614 


1234 


0 


KKPPNTPU* * WAKDIiNRHPIKDSS IKDD 

qyy*slgqcklisqynftctriaiikkt 
]srnltksw*gcgetgnlih/wieckmvq 


609 


14510 


A 


615 


327 


188 


FFSSSLLI ISIPKFiraVTSLLRNLK*AM 
LGSSHL*LQLLRRIiRREDHLSPGVSGCS 
KL * SH\THCTPVWVTE*DGRKGIiAGDGS 
HSVTQTGVQ 


610 


14511 


A 


616 


103 


446 
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DGGGALKKNPGGAQKNRGEKKKFFFF*G 
G\KKKTPGGFWKKNPFLGGGNPAPPPPK 
KKKPLEKKKNF*GGRGBKPSPLPCGKEK 
FSHKKK 


611 


14512 


A 


617 


3 


429 


glls 1 1 ykellqinki snlvgkwtkdk/ 
mkqflkkeihrlakVympcrystslvtie 
m*lktryyfhplnk1khdnnihc**gyk 
evgiiihillea* igkpfwkaitvtvlna 
pvfcl* fcfemefhsccpslkcraptsf 

IGAS 


612 


14513 


A 


618 


427 


6 


WGGPPPPPPIFFFFPPPPXPYFPFFFLQ 
ALFS*RVFFFriTPPPPKNFPPP*GPPPP 
PPPPPPPCV\FFFPPPPPPPPPPPFFFP 
PPPPPFFFFFPFFFFFFFFFFFFPFFFF 
PFFFFFFLVENCVFINS I TSDIEQKIS I 
SC 


613 


14514 


A 


619 


68 


434 


YSKD* PINTKKKKKKKKKKKKRGGAFKK 
KPGGGKNKGGKKKKNFF*KGGGKKNPRG 
NFGKKTLFGGGKKGEKPPQKKKSLEGKK 
KI *RGKGGKKT*NP/CGGKKFS FGGFF* 
KKFPPGGGGKY 


614 


14515 


A 


620 


454 


90 


NFPTPEKFGPPQGNIi*KAPPFFFF*KGI 
PFFFPG*KQRGGFiCSPQKPPPQGKTIFR 
PHPPKKGGPQGPPPPGGKIFFFLEF\KK 
JCGGFPG*PRRFFFPGPGKPPPRPPKKAG 
IQGETPRPGP 


615 


14516 


A 


621 


432 


2 


PEAFLSSLDHPAPGEKFFLKKTPEEKFL 
TAKKYRVFLPPPPLKIFFFP/LRALIFL 
GRFAQ I FPPQKKGFFPKIPRGVPPCPP * 
KKKNFFFLjSG*IFAPPGIFFKGAPPFFF 
FFFFFIiDRVWLCYPGWSAVARSRTS 


616 


14517 


A 


622 


422 


2 


GGGPPIiPPPRGGLPPPK**KA/VGGGKK 
PPPPAPL*NPPPPKKKIGGGGGGKMPPP 
PPLACFFFGFFLPPPPPSPWGGEKFFFP 

FLSGGP p p pqkkkkkkkkkkkkkkkkkk 

KNKKKNQTKKKKKKKRAAARDPRVRPRV 


617 


14518 


A 


623 


176 


401 


KFSMFI ILWKAYYTLCVCVCVCVCVCVC 
VCVIMYIiFISSQ/RRLCFLGEGKICSIS 
LTVLRRGSRVCV* SQDLPVC 


618 


14519 


A 


624 


1 


355 


HSSGLDNLTALAHSHLCCGV/ CLCFLVL 
VCVCVCVCVCVCLCVCVCL/CLYPGPQV 
PK*SKNSSTSCLLFHFTSARTICP/CC/ 
CVVFCFCFFA/CYQTCL*ILTSCL/AIC 
TASGVCL*ANKYSiyi 


619 


14520 


A 


625 


2 


400 


HTRLIFCRDGVLLCCPGWLQTPEIiKSLL 
FSFGFLSLFFFFIiERDPGSHGP \ VGGGG 
PNLG*WKFWPLG*KQFSFLTLWRRGNT\ 
RGPPPPPPGIjVFWF*KKTOPCFVAQAGL 
EQRPFGDQPWASQGAGITGVP 


620 


14521 


A 


626 


307 


3 


VFFRLPLTQGPGLFACFCPPPPCRFFFS 
FFPGSNSPPFP/L*KtiNPWAFEPPPFFF 
FFFPFS ILPKDRVLLCHPGWNAW* S * * 
P * TPGLN* SSHLTLPSGRV 


621 


14522 


A 


627 


1 


391 


NPPPLGGAKEGGPPRARGSNPPYPYWET 
PFFF*NPKIFPGGGGHPVIPSFPGG*GR 
KFPLPREGGVPL/RPNFPP 


622 


14523 


A 


628 


191 


375 


LILIFKTIPNPLILNPNP*SGGATQAGG 
QGRNLG*LQPPPPWLKRFSCIiNLLSSW\ 
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nucleotide deletion, \=pos5ibie 
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DYRGWP 


623 


14524 


A 


629 


3 


382 


HHCQIRLFILLITASEVLFCFVLFFFET 
GPCSVTEAGVQ*CN/HLQPLSPG/PPTS 
AA* VAWTADVHHHAWliFVCIiLI *DGSV?S 
VTQAGVQ * RDPSSVQPPTPGIiK* ASC\ P 
AFQKCLDYR 


624 


14525 


A 


630 


396 


2 


CRIEJSrVLKKKKLEMSNLSEEGMLKANIG 
QKLiSLLAPQQWNAKESLLKEIRSALPL 
NTQMIRMQNSLSADMEDIL/WWIR/DP 
TNFNlPLSQSIIQSNA/L/TIiFNFMKP* 
RGEEAA*EKFDAISG* FTTFKERS 


625 


14526 


A 


631 


47 


392 


LHSFFFLFIiGKKIFTPGGGGGGQKPFF* 
NPPPPG*RNPPFLPP*KTWPMGPAPPPR 
KIFGFLiCKKGFPLGGNL/SIGTPPPRGT 
PPPLPPRGGCYGGGPPPPPFIFFFRTFK 
NFSN 


626 


14527 


A 


632 


39 


382 


LFFSPFFFKNFFFFFGRFFFWGGVAPIF 
PPPKKIFFSQFP/LQVFPFSPP*KKKFF 
FPPP*IFAPPKTFFSIPPPPFFFFFPFP 
FPFFFF 


627 


14528 


A 


633 


2 


213 


LDKPGKHSKISSLQKI/HNGVCl4QS*LL 
GWIjRQEDCrjSEGS*GCSEL*FYHCTPAW 
ATK*DPVSKKKKGFIiK 


628 


14529 


A 


634 


2 


206 


QENGMNPGGRACS * PRLRYCTSSWATER 
DSVSKKNE/TNKTTLLREI *HFVGGPNG 
KKGLLKTVKGGLT 


629 


14530 


A 


635 


205 


2 


KRSLGLLAQI* /VQWGDFKTLQPLPPGV 
KQ I SRLNLLKKWD Y * RGPSGLGKFW/ 1 F 
L*KQGFPQFFRWFN 


630 


14531 


A 


636 


3 


399 


Q VQQT P AS CP LD S DLS EDE DLQXAMASM 
TFPKKKKKKKKKKKKKRGGAP*KKTRGG 
PKK/IRGKKK 


631 


14532 


A 


637 


122 


373 


VSNILWTQSLLFFFFFLKGSSLFVPQPG 
GQGLDLG^TKFPPRGLKEFSCLTLRISG 
NYGLAPP/HPG*FCFFIKKGVFPCCPGW 
F 


632 


14533 


A 


638 


390 


3 


LLVLFLPQDGWSPFAFPHEQKLPEVSPK 
ANAAMLPVQPAKP^AHPTFFFY* FPSFR 
FFFARMG*SS\YFVPNLTKLSEILKGPI 
NDMRE VFFP I FRLL I LFFFGDRVLLCGP 
GWS S WQ * L * L I AASA 


633 


14534 


A 


639 


392 


42 


PSGPGKPGGKTFLRKPPFQEACKRQGFY 
PLFPPKPLKNPKAPQNRKNLGPICPPPK 
P /QGPLRISPQNSQSGSL*GPNKGNCFP 

A*TWGPPGAP*RGPPKGPPFFFPEKVME 

AMGLA 


634 


14535 


A 


640 


390 


1 


IiS FSEEGMSTAEAGQKLGPLCQ/ TSQVI 
NAKETFLKEIKSVIPVNIQMIRK*NSFI 
ADKEKILW*I*DQTSPN/IPL/SQCXjI 
QSNVLTLFSSKECAERGEEAAEEKFINFS 
*RLRKEAASADREAAESyPEDL 


635 


14536 


A 


641 


368 


3 


KKGGYGVPPPFSPPP6PLGEGSFLPPKI 
KVQKTL/SWPPPPPPGGKKKTPFFKKKK 
RKKrFIiTGFFFFP*KKGPFFFFF*CPW* 
KFWGGGFKPIFFFFFF/RDRIiSLCCPGW 
SAVV*SQFTPRV 


636 


14537 


A 


642 


134 


413 


QKDQRNR lESLE/ IR/ PHTHNTL I PDKG 
VTGFQyGKDSLPNKWCWBN*RE/lTQNIi 
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SLDPAHTTTNVNLKWTRCLKARP/VKTL. 

VENMGENLRD 


637 


14538 


A 


643 


2 


436 


GRVES INLPLGCRLFKKRDNIKCGQ \G* 
RAGGSL I HCW * E C/ E WQPLWKRIQQFL 
IKLWIHLPRDSTILLLDIDLDGHPKTSA 
LFIITH1TCCPIHNHP*LSQPKCPSTDVW 
MHKPWPIHSMEHYSAIKSAAVWDNFKGI 
LL 


638 


14539 


A 


644 


446 


75 


LDLLTS *SACLGLPKCWDYRSEPPRPAF 
NF* /STTK*SLWSSHKQDYICRFIjIjSYT 
ATQKFSTLiTTY*SHLiCrF*NAQMPRLyL 
QSLILGS I F*KSNLPR* FQCAAKAENH* 
*SDLyCQIMLFSFM 


639 


14540 


A 


645 


318 


1 


KGVPRKCKNSYVGPPSGPPIK* ILFP* I 
F*DRISIjCSPGWNARVFS*LPV\T*TTG 
VKKWWPLSriLN3:*GYKGVPPRPGIFFFF 
FFFRDR/SLTMLPRtiVTWVQVI FQ 


640 


14541 


A 


646 


3 


348 


QFSSFFVLIYFTLPFMVSFCF/WCFSNL 
VLFCEI IFPLIPY\ YYSLVLSFFYCLIiV 
LFLSKLSFF/C*YLNITVFICSMTSFF* 
RIL/ CCLNPIKTSFNFLL* * FGMVLESC 
FFFFFPWP 


641 


14542 


A 


647 


2 


553 


AILIPDKIDLKKWTGDKE^NVTMIKGS 
IHQEDVTT INI YASNSR/APKYMKQKIiK 
GKEKQ ISP* K/ WGI FNTPFS I /DRAPR 
QKINQEIEDLNNI IRQIDLKDI YRTFHP 
TSEYIPFSSAYET/FSKIGHKLKHKTSL 
NKYK** TE I MQSMPS D /HHGMKLENNE /K 
VGKLTNMWKSDTLVSDWLGAVAHA 


642 


14543 


A 


64S 


388 


1 


GPPR*SPPFGKPRGAVPQ/GGGGLKPPG 
PQGENPFFFKKPKITLGPGWGP /RNPPP 
LGG*KGKIP*/PPGGGGSNKPNFSPSGP 
PGKQKQNFFSQKKKKKKEKKKEKPES*C 
PGTS PQP VTDGSQA INAS I S S 


643 


14544 


A 


649 


387 


25 


PGMRGGFPPFPLKNPFFPPGP* /MSGGG 
GGPNGPPPKKGFFPKNPPGVFFSPPKKK 
KIFFSPPPENLGPPRDFIiKGPPLFFFFL 
QKNPGVFFRAGQGNPPQKLAKIFPLPPG 
GGGGAVRHI 


644 


14545 


A 


650 


918 


2 


LGIiKGLTIYKILHSTIADHTFFSSSHGT 
FAMTDHILGHKIHLSTPSKE *EI I PSTS 
FQ /HHSRNLN* K* INNKVNWKI P K\ FWR 
LNKTLLNNT* TKE /GLKRHKNIIiS * TKN 
ITYQILWDAGKAVLRGKFIVLTT* IRKE 
ERSK\TTS FNIRKRNKKL I INIRTE INE 
I ENRKS EKI LN\TKS WFFEKKKS I KS IK 
TLARLNKRKRKETQIPKHQK*RRGITTG . 
NMANKKI KRNIHEQPYTHKLGWLDNIGQ 
FliEKRYLPKLKQGE I * SGWAY IN/SKEM 
ESI INTLPKRKAQDLKMFSQSE * YQTFK 
EKK 


645 


14546 


A 


651 


282 


265 


GVPFLKKRPTVF* KKKI * LGPHPQLKGP 
PGVFQ/HFPI *NFGIS6RGDLPPPPFFF 
FPPFPFIiPIiRQGQVSVAQTGAQWmiTQGS 
LQPPGLK*FS 


646 


14547 


A 


652 


6 


223 


LYAHKFNNLDSMDQFLQRHNTPKLIQEE 
/HRLTVIK*IKSIVI1TLPKQKA*GPDGF 
SGEMYQLLKESIIYNVF 
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X=Un known, *=Stop codon, /"^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


647 


14548 


A 


653 


128 


337 


FXFFFXXFFFXFFXXXFFFXXFFXFXFF 
FXXXFFFFFFFXXFFFFFFFFFXFFXLI 
FFXFFLFFFFFLFF 


648 


14549 


A 


654 


238 


2 


MAAASTTLLALVI ISTTF*LPQVNGYIE 
KS TPEE CGFDP \ S PARVP FS I KFIiLVGS 
PIQTHALDIALFLSHTRALPHTHT 


649 


14550 


A 


655 


325 


3 


CSEHKKNVP*LRAKKTNNPNLKWAKDIiN 
RHFSRENIQVAEKHI^ICSASLGIjREMQ 
IQTTKC/WHPVWKTV/WTILQN/LHTRIi 
PYD/PAIIiLLGIYQREIiKTCSHKNMYTS 
V 


650 


14551 


A 


656 


3 


1655 


FFLGMESPSVAQAGVQRCDLCSLQALPP 
EFK* FSWIiTLLS WDYRRLPPCLANFVPL 
VETGF\TMLSRMVLIS * PCDPPTSASQS 
GGITGVSHRVQPAIiLFFLSCIiCVKMIVH 
APLVKTCRPSTVGSCL/SVPAFWE 


651 


14552 


A 


657 


519 


84 


SRVRCQSAXQMGGASHLGYSGVRDPLEE 
AVFPFSDLKLHAGITTTLFKAVRQGHLS 
GFAKDF* RPAEQ /GT* YTS KLA/STKQM 
ILHEE * I TFVLLLPKHEGIPJMPSVPPKP 
FliFFS *GMTNSGSAGILDFVPNHPSKDG 
*HyVLrTSH 


652 


14553 


A 


658 


38 


383 


EKCI FFFFFLERGFTFGGPPGGRAQFWF 
NKTPPP/HKKREFPALPPPGGGNNGLGP 
PARENIiVF* KKRGFPL.GGKRGLNPRPQG 

NPPP*PPKGG/GNlvrGGGPPPRQKK 


653 


14554 


A 


659 


98 


394 


KPPGKKRETPSQ/NKKKKKKKKKKKPPF 
PPRGGKKNPFKKKRGG*KNPPPRGGGKG 
ISrFPPVMG*RGKNLPPPRKPLPPPPQGGG 
KN*KK/RA*KNSPQNFF 


654 


14555 


A 


660 


214 


567 


GFS VIN IGLKMMMM 1 1 1 1 LRDS LTVTQ I 
GVQWHNLSSLQPPPPGFK*FSCLHLLSS 
RDY*HAPPCPVNFCILS/ERQFHHVGQA 
GLKLSASSDPPTLxAPQRAGITVPSHQ 


655 


14556 


A 


661 


463 


392 


S*HFHPSSSSSSSSSSVINYY* /FGLLL 
C/QVAHFYNS IDQQMIQSQRPMMLQSAL 
AFEQ 1 1 KVWGLL I IiL 


656 


14557 


A 


662 


205 


242 


GTGWQGGGLDG* QMTLGWPGSGVGGSAA 
P*GI/GPGLPGLPAPPAPSP 


657 


14558 


A 


663 


63 


351 


QSETPSQFKKKTQIELLYEPAVSLLGIY 
QKERKSVLKKHYTPMFTAALFTI\VWDQ 
PR/CPKYMKR*N\nviY/HVYTHTHTMEYY 
S ATE KNE I LF \ FATT W 


658 


14559 


A 


664 


1 


358 


FFLRQGL/ DSAAQAGVQWCNHS SQRAPP 
LGLGRSSH\PASLAGWSAVA*SRLAATS 
TFRVQVILLSQPPE *LGLQSCS ITQVGV 
QQWRDLGSLQPPLPGFKRLSCLSIjPSN* 

DYKRLAPPP 


659 


14560 


A 


665 


10 


257 


GMNERGNITKEIKKI I WEYYE *LGTSQIi 
DTL/ELNKPPERYKL/PRTNSQS ISNMN 
RTITSDYISNFKRLPKKSPGLDGPTGEP 
I 


660 


14561 


A 


666 


360 


2 


ENKKI FNSAPPPGS PFIjFPPLWE IRPEG 
FLKPRVIiNPPWPPN*APPFKPPMGPPFF 
PLFRGVRVGDPLSPPGLRLP* PGLPP\G 
PPG*PW6QNPTFFSKKKKKSWLDSSGFL 
EPNISVNF 


661 


14562 


A 


667 


328 


1 


nlpaenyecpenkkknfqkkkihqiwqn 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














SEFMKVTEYKLNYQN/SQLYLYMPEKNT 
* KV/ ILEKHPL * PQ ITKYFGE ILKKDVL 
NLYTDKYKISLREVKGEQNKWNlSrVSS 


662 


14563 


A 


668 


405 


566 


IHCQWECKTEQILWKTVWWFFTKLNILL 
PGDSAIML.LGIYPKELKMYIHTETCT 


663 


14564 


A 


669 


218 


2 


LLMLESLMFVPPPPSPEKWAKL*IY/CH 
GAHANFLPSFFPSPPPFSLSFFFFYNRV 
FLCCPGWSAWRSQLT 


664 


14565 


A 


670 


2 


210 


NFGQEENSEMNSX»CSyiiHNIjEKGEQTRP 
KASRRKEI IKI *AE I *K\VQRSNRENK* 
KKWFFEKINKIDKPIiARLTKKWRT* ITI 
VRKETGTITKDPADTKRIMKEY/YKLLY 
MHSF*NLSRNLKRYREVIEKINERSGSL 
KRSIKLTNL 


665 


14566 


A 


671 


200 


3 


SLCHLPHVASKATLEr\GLVEHMW*DFF 
CFFETESHCIAQSGVQWCNIiGSIjQPIiPP 
RFKLFSCIiSL 


666 


14567 


A 


672 


94 


374 


PKFRPQETTEQTSQPLQQINCKGRKRQG 
KRTYRLGEIQKQSQPMATFELYLD/H* F 
*]:iAKETL*LGAVAHTCNPSTLGGQAGWI 
TRSGVQDQPGQ 


667 


14568 


A 


673 


312 


3 


WKMGQIiPGRPQLPQLSQEKILSUSrSPlMV 
FKKKKLK* * *NILPKKKRPDCFTHKFYQ 
TFKEE I IPF/L /HKLT*EFEKKE 1 LLKS 
F\YEGSW*AAKLNADTNKKTTH 


668 


14569 


A 


674 


34 


438 


QLT*PD*HFIKYSTQQQKTFFSSTHRVF 

aklghnlvyXkanlnkfkwlq vr snmfl 

DRDRIALKIISJNKK/PSSPLKYLQTHLLIi 
NDPRI KEGSKRE 1 1 KDFALNDNATY*NL 
WCL*NGTLRKVYTTKCLFRKEGPQiyi 


669 


14570 


A 


675 


3 


349 


qkdrksvrpss*nppiiavekipi/lkqn 
perfngpaffkiahgltsslvfclansn 
yerthsriiilsqglqtgfaiktprkll 
aslanlaiipptinllgels i 


670 


14571 


A 


676 


346 


1 


kmvkll*nivwkllk\gl.tevphgsaif 
pffipifpkemetnvhiktctwafpaaf 

FPiyiAKKKKQSGMSrPTTDE*IQKK\W*TH 

I idyysaikrwellthatscnitlskrs 

QTKK 


671. 


14572 


A 


677 


357 


6 


AAGDS ELKPMF I YCFQKPMPL * I MLKS T 
LLVLYKWNHKAWMTACLFTAWFTE /HCK 
PKLETYFSEKRFLSNIT^LMKASGHPRA 
IiTEMYKE INWFKPSNTTCTLHPMDQE V 
ISTFNSY 


672 


14573 


A 


678 


401 


155 


YYS VAQAWQWRDLS S LQAP PPGFE*FS 
C\PASWDYRHKLLTSGDIiPASASQSAGI 
TGVSHCAQPLSLFVPFLTNQRFVAALS 


673 


14574 


A 


679 


292 


395 


Q * KI S FEMIilil ITNVPGHPRTPMEMYKB 
lnffXmpantsip* PMDQGIVLTFKSYY 
A/RWVPCKAIAV/DSDSPDGPGQSKLET 
FWKGFTI *DVIKNIDDL/WRGVK1PILT 
GVWEKLIPTLINNFEVFKASLEEI 


674 


14575 


A 


680 


275 


3 


WPRQASIiAIiNT*SIHIYIHTDMRTHAHT 
Y I FGMKS VTQAGVQWHNHGS LE P * S PGL 
RRSSHPSLPTSGEHRRTPPR/LADFLHC 
L*R*GFTML 


675 


14576 


A 


681 


330 


507 


GS V* PVNIilRNCQPFVQSGCA/ SLHSHQ 
E KWYMGSPHPRHYLVASVIFIIAILFFF 



560 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=ljysine, l^Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion ^X^possible 
nucleotide insertion 














liRRSL 


676 


14577 


A 


682 


390 


43 


HHLP I R YI CIiS I SS ILCLS F IMYLS / 1 Y 
LS I YLS I YLS I YLCQRCQI PLVFLCLIiL 
S*LS * I IIiILNFSCFFVGRTlVLS 


en 


14578 


A 


683 


396 


63 


ADPTE IQRI I SD YYCIQRHTNKLENL* KM 
DKFLKTYNLPRLN* EEVETLIRFIAISTHE 
lALI IKSLPIKRSPVLGGFSVGF*HIDK 
EELTPVLGLFQK\ rKKEEILANSPYESR 


61Z 


14579 


A 


684 


1 


396 


EETLPIiFADDMIIi YI IKKKQKTKKTTKE 
STKRY*N* *M\WVFGKVAGYKVNTQKSI 
N* QYT I * KVKEiAS \ FSS S PQKNKIGINL 
TKEIQJJVYSENYKTLKEIKDLNK^ESIP 
CSQIRRFNIVKMTVIiKLIYR 


679 


14580 


A 


685 


283 


3 


NCVESSNTHLWEPSQKPLSSVSLTDVCP 
PMLSAALFAIA\RSYTLPTRSSIDE* IK 
KMWYIHTMEYYSAFKKKRIMS FLMPRME 
LKIFRVNKIR 


680 


14581 


A 


686 


53 


324 


HCPGDRIRAAFCFYILDCPFFISITHKL 
I *I FFPFFLERES IPVPGVGGGGPNFGS 
liNPLPPRIiRRFSCIiTLPRGGDYGLGPPC 
PT/NFCVF 


681 


14582 


A 


687 


39 


208 


NIFFCREGFAMLARLVSNS*lSrWPGKVTH 
IHNP/STLGGIiGGRITQTQEFETSLVDA 
GA 


682 


14583 


A 


688 


47 


341 


S SGRVFVLFCF FETGSHS / VLFRLAYSG 
GIS A YCS CHLLS SGDSP I SATR/GTTGM 
CCS AQLGFCVCVCVCF/ I FVF* VKIGFC 
HVAQAGIiELLDSSNPPTS 


683 


14584 


A 


689 


229 


1 


GRVDGRQIACQEF3CARLASQSAGITGVS 
HRAQP FFCFC/LLF WFEMGSHSVTQAG 
VQWCAPGSLQPLPLRFK*FSC 


684 


14585 


A 


690 


339 


1 


KKGPPGGPTKPEGGRFPRGPPGGGKEVF 
PPPPPGGKGEKKRGGAPPPFFFLKKKRK 
SGKGG\N* FFPPGGGKKGGQKKKKRVFF 
PKKKKKKKKKALSLSLSLSLSLSLSLLG 


685 


14586 


A 


691 


372 


0 


YDYLYIKKL*N/IjREKIDKFLDTYNLSR 

lnqeeienlntpitsmkietvikspptX 
kqkspgpe 


686 


14587 


A 


692 


117 


297 


APLLESSSASYLHFSN*DLQMAH/KHTK 
RCSTSLAFREM* IKTTRRHHFTPTKMAT 
SKRHT 


687 


14588 


A 


693 


122 


322 


EQTNS WS IDFFFQRCHSNAMGEKNCLFN 

KW*WDNYVAI\AKKMKLSLFKPYIKTNS 

K/WIKDLNIRAKT 


688 


14589 


A 


694 


21 


342 . 


RSHFRHLNEYYATQQGNSDVHPLLIHSN 
ISTAIIWQIiQRQERRKEKRLKEVKNEKA 

iikei ffiirgwvllchlgws pwgs * fi v 
tlnXfglrqsshlslpsswdyrpv 


689 


14590 


A 


695 


2 


343 


KFVPLHSGLGNRSQKQTLSQRKKKKGGV 
FVAWAKVQWHNHG*QGP*TGLRQSSCLS 
L * GGPfVKTi/PGGRHHAWGFHHVKKVGLE 
liLTSDDAPSSASQKAGITGMSHTAPGPK 
NLV 


690 


14591 


A 


696 


336 


3 


xppvkspkfpillkkkkfslfcfsgapf 
pcysppraqkkgw^kifspppkXqkkpg 
alffspkpaffpppfkkkkkkkkakegh 
ylimikgliqqeki ilniyasnigarrrg 


691 


14592 


A 


697 


374 


2 


AFY*KKMVSRTFIAREKSMPRFKASKDR 



561 



wo 01/64835 



PCT/USOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D— Aspartic Acid, 
E=GIutaniic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I^Isoleucsne, 
K=Lysine, L=Leucine, lVI=Methionine, 
N=Asparagine, P=ProUne, 
Q=GIutamine, R=Arginine, S='Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 














LTFLLGAGNFKLKPVLVYHSENTRALKN 
YAKSTLSVLYQRTTKAWMTSHLFTEWFT 
EFFEP\TVETYCSEKKIPSKII)LLTDYA 
PRHPRTLMBMYSC 


692 


14593 


A 


698 


361 


3 


PQMCPPPPR*QGIIFIj*L*THCFFCTVK 
ARLFIiFPPF/S *RKGGFFFFE *RKFFKY 
FGWELFQIRFKVFFFCDRVSLCCSGWSA 
GART*LTAASTSPAQASLLPLTLPSSWD 
YSHVLPRIiV 


693 


14594 


A 


699 


75 


362 


KDCAYGICSKKQKGFCIiTQTGAWQQYEE 
I*GLET/GFSRSVHSMGQRRYWBYRHAL 
THPANFVFLVETRFLHVRQAGIELPP/ S 
SSQSGGITGVSQ 


694 


14595 


A 


700 


2 


277 


FFIiYTSNEQSEKKI /RETNWFI ITSK* * 
NRNVNKE I * DL YNENS KTFLKE I KEDLN 
KWKGILSS * TEKLNI IK/PTAI * * FNAI 
LIK/IPMVFDK 


695 


14596 


A 


701 


1 


407 


GTRKHFSKEDRQYHKSLRKQIKT/TVSY 
CFTPTKLTTVGKTANSK/CWAGCIOriGT 
LIHCQ*ECEMVWLLWT/SVQQFL/RNVK 
L YETAI S LLGICTRE I KTPLHI 1 1 \ I lA 
KKWQQLRCIiSTDBWINNM*YIHTMEYYS 
A 


696 


14597 


A 


702 


209 


3 


SLNRRLSFFPY*RQGLALLI/EPGWGAV 
ARSWLTPASN\FELQRSSCLSLRSSWDY 
RRVPLHRVS LRFS C 


697 


14598 


A 


703 


308 


3 


TTGLKRPARVGPPKCWDYRCEPPGLATS 
SVLKAFQPrGSVPPRIjSRlLSFM*N"Q\* 
YRML I TS TKY FPRMVS I SRP CDLPAIiAS 
QSAGITGMSHHAELVPLV 


698 


14599 


A 


704 


603 


2 


PLPP\GLQVSSP/CVSLPSSWDYRHAPP 
RPANFS VF * * RRGFTMLAKM/ VL I S * PC 
DPLASASQSAGISGVSHHAWPKQT*LLD 
TDKTEG I FLTNHLLR I P P PMF YQ YI L I K 
VLESVKETKDKRKSVSSCTLYRCIi* ILF 
IS *AYIKFTTFFFF\ETDSRSATQAGVQ 
WHGLSSLQPPPPGFK* FSCLSLLSSWDY 
RSMPPRPA 


699 


14600 


A 


705 


3 


366 


ARVTYYSGKKEPFGYLGMA*AMISIGFL 
GVIV*AHHIFTVGIDADTRAYFTSATII 
MAI PTGVKI FS *LATLHGSNMK* SAAVL 
*AI.RFIFLFTVGGIiTGIC]:jSNSSI>DIVL 
H/DTYYVGAHF 


700 


14601 


A 


706 


358 


3 


GFHHVGQAGLKLLTS * SARLRLPKCWDY 
KRE PPHLAPDRKYFS PLVFARDNF VPQG 
P* \HVWINICLSQVKAATGIYPGEAGAL 

RNRLHPH * R I F WPQRPR VP I KS P WTLRV 
ISTLCSC 


701 


14602 


A 


707 


358 


67 


QLSSTIWY* *KN/RIDQWNKIAQNTSVH 
RKXiISDKVTKPIQWSKDSIiFNKECWNN* 
TSI CKK/LSLDSALTLFTKTNSK* ITEP 
GSVAHTCNPTTLGG 


702 


14603 


A 


708 


34 


363 


RTVFFFFFFFFFFFPKKTPFFFPGPRGG 
*IPPPEKKIFPPVFRGKFKKPPHLDPGG 
GNFPRKPPFFFIiGPXSFFFLGKTLGGGG 
WIPFPPPWPNLFFKKIFWGAGGPSPF 


703 


14604 


A 


709 


1 


380 


STCKRMTLDSYFTPYTKINSKWNKDLTV 
RGRTIKL IEENIGVNIiHDLGFGSGPSDM 



562 



wo 01/64835 



PCTAJSOl/04927 



SEQ 10 
NO: of 
nucteotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny]alanine, 
G=GIycine, H=H!stidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=*Arginine, S'^Serine, 
T=Threonine, V=VaUne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














TPKAQAKRKIN/ WDFIKYIiKFCA*EDS I 
ERV/N/RRSTEWEKIFA1SIH1Q*GN*NIQ 
RILiKLSKl»*ARLKWAEDL 


704 


14605 


A 


710 


382 


30 


PGTTGVHHHTRLTFVF/ CGRAKI S I CWL 
WNS * AQSACLSLPKCWD* EPQHPARPV 
SFNTRSGIPIISTNPSLSLSAFPSQGFV 
FLFPHPPASHPETALABFFLGTFSLSPP 
GPPTPAS 


705 


14606 


A 


711 


1 


377 


LLVICGMQMKITVRCNr*TLNKIAIIEKK 
TAKCC*HYGGLKTIiIHQWCKSNMVQPLC 
KTV*QF*KKIiWIQLLYLiLK*KICAYIHPS 
NCTQMFIA/ALFSVIAPWWKQSAY 


706 


14607 


A 


712 


1 


390 


LKYKIMIiGMAAWACNPNYSGGPGNITA* 
TW*AYMAIiTRLEPGRGDHTAALQPGRQS 
TTPF*KKKAVPFRA/RPVKMREREI*KP 
FS PE \RT YS CAQEGPGRTFGS AQDLEAA 
GGRGHHRMGAVWQEPHRLLG 


707 


14608 


A 


713 


2 


393 


KVRRQI IIJVDTATDS IGIKWI I WKHYV* 
IiYANKPNYLAEMDTYVERYKLP \ KEIDV 
MNSLVPAKEIXVWKILLTKTPDPHIFT 
NSVYQTFKEYY/PVLYKLFQKIEDSRTF 
LS SFCEAG INL I S I CKRFFF 


708 


14609 


A 


714 


219 


1 


PRPGAVAHACNPS TLGGQGGRI * TQSS I 
IKACCLIHDLAIQTQKQMTXWPGAVAHA 
CNPSTLGGQGGRITRSGJiJ^DQPSQHSKT 
PSLLKIQKKLASWAGACNPGYFENYIQ 
KLIiRGPGT 


709 


14610 


A 


715 


381 


126 


I IAGGGGVCT*SQLFKRLRQEDHL,SPGV 
RGYKEL* S YPCTLAWVTE*DPVF* KKNY 
/TDNTSPKETNKK*GVRCIKTGiCVXjIFV 

AAG 


710 


14611 


A 


716 


417 


1 


CYCCWGATNKILLLTDNASGRQIVLMDM 
YKKNNWFVPVNTRSFVQLMDQGI ILNF 
KSYLRN/TF/HKCIAVINSNSS*ESGQS 
KIiKALWKGFTVL YAI QNI CDSGQVKMPT 
LTEV/WKEVILPLMGDFVRKRTSEKCaNY 
KLS 


711 


14612 


A 


717 


406 


122 


MFSRDAVSLCCLGWS * TPELKLS S SLSL 
PKCWDyRGEPPCSAAKDF*IIRKR/HSC 
R]SriIRNSDAWQR*TTLVSYDRNPP*FPS 
PPPNSPLGPAA 


712 


14613 


A 


718 


535 


1 


HNLNKIV^NLHVENYKMLVKDKKT* ISC 
LWVRR XliVKI PVXiPKLFYSVI VFP 1 RSL 
VRYFIEVCTFTLKCIPKEKDLKQPWRF* 
KKKKKVHLPFIKVYY\KLQ*yiATVIKT 
VWY/W/ YSKIDKAKWYKTEKAEV* PHEY 
DQLIIiAEVQINH\NLFNKCY* SN* TTIG 
KNMNLNLSFTSYT 


713 


14614 


A 


719 


255 


1 


GKLiQPNKGIiISLIY\NL*KLVRKSNPKE 
KWAKGMNR*LTBKGIQIAPRHMKRCSPR 
FIMKAMHI I TTWS YHPSSKRKTR I QRLT 


714 


14615 


A 


720 


381 


0 


KFIAVNEYI IKEIDLPINNLTLYL.KEL./ 
EREELTLCKASRAEVI\KIRAKI IRELT 
PIRKTDNQSWFFLKIHKIGQ/PFKLD*L 
QKKKRVREKVPXTKLRYENRVITTNFTE 
/IKRIIREYYDH 


715 


14616 


A 


721 


3 


381 


lYREWRDLTRDGTYQGHHAPPGEKG/VC 
RYGI ILF ITSEGFFPAGFP* PFNHCSLA 



563 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioniney 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














PTPQLGGPWPPTGITPLNPLKVPLIiDTS 
VLLASGES IT*AHHS \LIENNRNQLI QA 
LLlTILIiGLYFTIiLQA 


716 


14617 


A 


722 


56 


455 


KTI INI KPPAI IQYQTPLFV* SVL ITAE 
XLVLLSLPGLAVGITILLTDRNUSTTTFL 
DPAGGGDP ILYQHLF* FFGHPEVYILIL 
PGPGI ISHIVAyYSGKKEPFGYIGMV*A 
MISIGPIiGPIG*AHHIFTGGXA 


717 


14618 


A 


723 


398 


3 


HQLPVFWQYNKKAWTTRPLFVDWLH* CL 
VFE VS KYPAS KGIiPFKWLMLDNAPD /H 
PTQNSMSS /DSKGIEVI YLPPNTMS LI * 
FLDQGVIRTYR/RHYPQYSMQRS IRPMQ 
E I SNKENI I KVWKNSTTDDAl VA 


718 


14619 


A 


724 


318 


14 


TPPFFIjNIPNLPFIjKFPSPPPPWFLTKS 
LFFPLKXJCPKVFGK*SIjSKF/CPFPKFN 
PKPQKKNFFFPPPPFFFFFFFFLRHNIi/ 
VCHPAWSALAPSQIiTATSAS 


719 


14620 


A 


725 


2 


371 


APRCKRFSCLSLPSGWDYRCMPPRPVNF 
F/SIFSRAGGLES*PPMIPPA*A/FPK/ 

CWDYRHE 


720 


14621 


A 


726 


430 


97 


PKKNPFWKRA*NGGAQKDVPLSLP/SGW 
AGGS P WGPGF * PPLGP KGGPPFSKKPPG 
GVGPFLMPPTPEGGGGKLA* PWPPGFNP 
PVSIFLPRPPRAGGKKGTPFPKKKKKEK 


721 


14622 


A 


727 


406 


1 


GGAPPPPFFFFPFIKKGVFPF*QEGFFF 
PGLKN/LPP*PPKKVGVKGGSPPPGGFF 
FFFKKKVFFFF PGLRTRAQ I KVPSTLVG 
QG*KILPPQVSQEGGVKREGPQAQLFFF 
FFFFEMESHFVTQAGLQWRDIGSP 


722 


14623 


A 


728 


98 


368 


KPHKNAKCP IRVAI VADMENNKYW* GYG 
EMGTL/LHY*WEGISriEQPWE/T/VWQLIi 
EKLNTELPCDPATPLLGIYPKTLESRDS 
NRFFIFFFSF 


723 


14624 


A 


729 


19 


404 


VCVISICLATEILFFLHTRPCVCIYCVC 
VCVCIiAWGTLCVYVCIESCVCVCFHLT 
GVLC/ V* VCVI S I CliATE ILFFLHTRPC 
VCI YCVCVCVCLAVWGTLCVYVCIES CV 
CVCFHLTGV\ C VC VCVC lES S VCVCPPT 
CGTVCVCVRPPGVLCVCVCVCQAAWVAV 
SMCLTP/CPC/VCVCVCVCV 


724 


14625 


A 


730 


432 


12 


FFHKPNFPAARKGRLFFPIYPSKCLISP 
KAL*F\FGGVGPFFPPPKEGFFPKVPR* 
CFFRPLIRKKQILFLPPLNLAPPGVI *M 
/PPPPIVPFFPFFKRDRVSLYCPGWSAV 

A*S*L/TAVRTRGLK*SSCLSLPKCWEY 
KCEP 


725 


14626 


A 


731 


332 


7 


NKTRKEKFLTRSISRF/CFPPFPIjKIFF 
FPLRA*FFWGGLPQIFPPPKKVFF/PKI 
PKVFPKWPPLRKKIFFFFSPLILGPPRV 
LLKGPPLPFFW*VPFF*DRALIiCHPY 


726 


14627 


A 


732 


70 


294 


FLRPCVETAIKKIAI IIiANP*NPIj*RWG 
\PTMPPRIiVSKL*DSSDPPTSVSQSVGI 
TWSHHZU^IjNFFYYQNSRR 


727 


14628 


A 


733 


1 


355 


FLLRHILLCHPGFCSVATAYCSRDPGSS 
D/PLPPQAPLPDQ* PRLQA/WHRLAPPH 
SANFF**RQGFTVIaARMVSISQPCDPPH 
SGPQGA 


728 


14629 


A 


734 


50 


395 


IPGIiTRQWLiLDPCASPSTPPYT/P*VQP 



564 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ lO 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F==Phenylalanine, 
G=GJycjne, H=Histidine, I=Iso]eucine, 
K=Lysine, L=Leucine, M=Methionine5 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V==VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '^^Stop codon, /===possible 
nucleotide deletion, \=possible 
nucleotide insertion 














STPQNSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAILEHTSSSSSSSSSSSTCRWR 
NGYTYTVPEHRPARGHTASQTRKQVLAA 
THKP 


729 


14630 


A 


735 


451 


0 


GPS S S PTNI PT P VS WNSFK* S S QVAGAT 
GTCYHAWLIFVFSVEMDFY/HPVA*ASQ 
SAQLCQ YFMI FLRQGVSLCPPGWSAVAW 

SKLTT/CLCLPGSWDHRCAIAHPANFLY 
FWWR/H3C 


730 


14631 


A 


736 


2 


477 


PSHVSNKRLISKIYKBLIHIiNSKKVSNP 
I*K*GQSLNRDFSKASFriPP*KC\QiyiVN 
RYIKRCSTSLI IKEMQIKTTMRCHLTPV 
RLAIMKKSKDNKPSLVWDC* IRVLHLF/ 
CK/WICQYVQL/FWLLLSNFLVKQPF\Q 
IiPAPPWSPTLG 


731 


14632 


A 


737 


3 


2344 


AAGGPTAQSPAQLAGRALRIiARWRAVAV 
GACRPGAGSPCSVQGGAASELSPRPQTW 
IGSLKP*TFGAAAG*AHRGCGGSALLN* 
ATPRPAPGLPASPTSSQALPAPLGAWGH 
SDHQPRAPP * SPQASTAIRKEKKQRAQP 
GRAS VCPASNPP I S SRADPVLQHGPPAI 
SGAGSAVASQAPGSS/GSHAESGSPALA 
HTP*GS*EPHSLIVESTRKS\ELPSSSQ 
GRLLLPLLTPGVAS /PVGTKLPGATAAT 
AGALHQPGLRLSSLQGVGGAKNKQTGCC 
CLQLPTTGLPQAPGALRPLGRLGPAMP 
GEHRQRTSPQGTVPYGGIRQGWEQPQRL 
RAYGTALPPPHTPPGSSGP/RQAPGCRG 
SGAGEAAGIRDTGGGGPGRRAPCSQASP 
GRGGWQAQVGCETCRGCAQSS /GGGAVQ 
PGLPRKPYPHS/AR*ENLWPFPCSPRT 
RAQEPQTQGEE/GVEGPQGSPCAPGAVR 
GRGIQLSSEPGKLARQG/PASGDGP*EG 
TGQEPSQAFSSARHPWPSEARMPDTAIC 
RNQAQ * LQLT S S Q P SMGP QLRKS LPAT P 
QPNSYWDGGNSATLGRTTNTRRHCGMSN 
FGARGDLLGTVPT* QPLMQRRKEKPRVG 
GEPVQSHTVCG* \ PAGVSRGWPLRPMLP 
ERWRPLSASSGSQGRPGLHPPSLA\CGP 
S SSPHRTCS P /GLDPGLPDAGS I KPPS L 
VGAGQGAGST/GLD/GPGLSLLSPGKSL 
LPPSSPATGLSGLGWAQSAF*SLLTVA* 
WLNPVPNGPSDTADCTPAQAPTAPAMLE 
NQANKSDFFFH 


732 


14633 


A 


738 


37 


450 


NQKKWPPPRSSKPARPLFFFL *QKYIKD 
P^KTYQGCFFFSPPPKKKNPPPPKGGGF 
FFFF * KKKGGGP P P /QAKKRGGGGPPQK 
GGAKKNPPRGFTRVFKGPFKKRGPP 


733 


14634 


A 


739 


2 


426 


QEFVLGS APGCGC W VGGEGRVS S PWGDR 
KVGGPSNSRHWNKGAGPSCPPGPSLAAG 
LPGRRAGSWAPVLW\PGPP*GLAQLPSP 
LW/PPSPPQEREPPLPD 


734 


14635 


A 


740 


21 


392 


AQSFKSSLGNMERPLPYQKKKKKKRPGG 
GA/RPLETQPPGGPGGGEKKGGKKKTSR 
P / IRGKPPFL* KKKN\CLAWGGGPAL* P 
QFFGGVGEKKTFNPGGGGLGKPKPPPFT 
PTRGKNQTPQKKKKK 


735 


14636 


A 


741 


1 


40S 


P S SQGG * E AGG * LE PRS SRP AWAI * RDP 
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nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cystei'ne, D=Aspartic Acid, 
E=GIutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L.=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R^Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, "^^Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














/GSPEKGKKKKKKKKKKKKKKGGGGFKK 
TPGGAKIKGGGEKKNFFFKGGEKKNLGG 
ILEKKLiFFGGGKNGEKPPKKKKGLREKK 
KF^RGKGGKKAENPGGKKI FQKKKK 


736 


14637 


A 


742 


394 


1 


RDPhEEAVCPFSDLQLRAGRTTALiFKAV 
RQGHLSLQRLLRLFVCLiCPAPRGGAYRS 
RQASLNRGGFHPVRAYLLLCLPKQASEM 
AGAPSPA*LPPCSLISDCCASNQ*DSVG 
/VGPSEPGEGYNLWRRFLS 


737 


14638 


A 


743 


384 


1 


FTGPYLKKNGYSFIiFDYPPPSVLGNWQP 
PHPGRVFPP*DVNFNQW1FVSIj/ISSLK 
DEPAFSSrLYSF*QHFFFITYLPPP/CF 
FLIESCSVAQGGVQWCNFGSLQPPPPGY 
KRPSCVGLPSSWDYRCV 


738 


14639 


A 


744 


2 


410 


TPtiNPLEVPLLNTSVLIASGVSMA*ADH 
SLIENNRNQI IQALVITILLGLYFTIjLQ 
ASEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFEAAV* YWHFVN/ VRGIiFLYGS I YW 


739 


14640 


A 


745 


393 


1 


PPPPPRGRGSPPPPFSRKGGPKGKPPPP 
GNFFFFFKKRVFPF\CPGGFKS*V*GVP 
PPPPPKIFKPKGGTPPPGLWSPLKSKKS 
QFFWVPKPGERKKLFPPFFPFFFLRPCL 
SLSPRLECSGAISAHRNL 


740 


14641 


A 


746 


2 


290 


KNLjCRENYKYE*KKIiKTMKDLCKEYYKV 
*MK3CIKENM*KDKLMNPK\KNIKISTLI 
KVIYKFNAIPIKTPMTPFAE\LK*rLKC 

V*NYKRP*IVKAI 


741 


14642 


A 


747 


59 


364 


CRFFFFFGLGGGGGGGGGGPPPFF*KNP 
PNPPKKIPNWGFFFRGPGFAPKKKKGPF 
LGEPPP WGAPLYNPP P P FGGGGAKFPP I 
FFPRAGGPQKRVG/RPPPPPPPPPPRPK 
KKKNLH* LECVEGTEEPPTRPPTRP 


742 


14643 


A 


748 


2 


371 


SSCLDLPKGWDYRHELLHLAHWLVLGIF 
LLLLFCFVLRW/RSCSWQTGV*WSDLT 
SLQSLLP 


743 


14644 


A 


749 


339 


198 


IADMEKV*AF*TEHKTNHISIiNQN* I*S 
kvltlfk-smkaerd * EGPEEKFGASTGW 
FMRFQE I SQLYN/ IDV*GEAASADGEAA 
ATCAEDPAKI PDEGGS PK* Y I FNVD * AT 
FIRDLSWIFCTSCCSFSISTCCFTLHVM 
L 


744 


14645 


A 


750 


249 


2 


KXFKPPAFPVSFPPFPL*DSSSPPEP*I 
SGGRVGPI\FPPPKKGFFPKNPPGVFFS 
PPLRKKIIiLVPPPLNLGPPKGPLKRPP 


745 


14646 


A 


751 


1 


288 


WNHMTDEGLVSRI Y* QPLQL \KRKTEN 
PVQKWTKCPNRRFSEEDEQMARKHKQGW 
PASRAIRGTQSKIIMRCHPTHTRMARXK 
KEKEAGPGGSCL 


746 


14647 


A 


752 


494 


2 


ETGSWFGPQRIiEVQWWIKSSLQP*PPW 
APVILPPQ/XiSSG*DHRHVPSRPG*LKK 
KFPL*RWGLTMLPRLIiLNNLPTSASQNA 
AITGVSHCARSASVFLILRQGLWVWEB/ 
GAQW*SAIiSHLPV* /IiSR*WCQP*PLG* 
DSGSGSPLWRRriAYr.PLCAriWKEVTMCS 


747 


14648 


A 


753 


1 


533 


YIiSPRLECSGGITAHCN/LRPPGLK*SS 
QLSLQ\SSWDCRPAPPRPAARLIFFCRF 
EEGRRSHYVAQACLKLPGS\SNPPTVA/ 
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nucle- 
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residue of 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W— Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














F P KCWGYRCEP PHTGLCS / GLS E FGF CL 
WWGACS RS SHKNPGCGTA/ CPAVFLHS V 
GYASHPSSTSRTAFTLGWVIiVPVFRDKG 
LTVFSQAGSTVGVILG 


748 


14649 


A 


754 


1 


346 


ELSKSTLPAIiCGGNDKAWMTAHRFIAWF 
TDYLKPTVEISIYW* EKKIRLKVLLIilDNA 
PGHPKALLEMYTEIHWFMPADTTSIL* 
PTHQGVISTLKFCYL*NTFHKVQ\CYID 
SDSS 


749 


14650 


A 


755 


1 


323 


EDQTSHNVIiLSIiNLIQSKALMLFNSMEA 
E /R/SEEAVEEKFE ASRE * FMRIiKERS C 

LQNIKVQGEEASADGEPPACDPEDIjAEI 
TDEGD C I K* Q I F I VDKTAFYS KKMPS 


750 


14651 


A 


756 


1 


344 


ARQQPGDPARGGGRGRRL\QPRQAASFC 
TGTLARARHTCBGAGVLRRPADKIASLN 
*HPSQKKKKKKKKKKKKKKKKKASSSS S 
KKKKDSSYSSSSSRGGSSSSSSSSPPPP 
PPP 


751 


14652 


A 


757 


119 


317 


NRYFPFIiRVKKFPWVEGPFFFLR* SLTL 
liTLVAQAGLQWRNLSSLKSPLPGFKPFS 
/PSA/IiSRWDYRR 


752 


14653 


A 


758 


2 


363 


CILiAIVKSAINRIAN*YI*KCSMSriTK 
RQA/ 1 KRKNKTRR YQL I P VRMTL I KKKK 
RW*RCEEKGRLAHCWF\ECK*RQPL*KT 
K*RFLKKLKIiP\FITAIALLDIYPKQIK 
SE / CKKH/ CAILF I ALFT lAK 


753 


14654 


A 


759 


358 


1 


KADFCFPPTPSPRVKKIFPPPFPKKWG* 
KGP/LPPPGENCVFFFFKKKRGFSPLGG 
GFFYFFPPGFPPPWPFKKLGFKGLTPPP 
GPPQKILVFPFFFFFFFFE/DGVLLCRP 

GWSAVAQS 


754 


14655 


A 


760 


316 


2 


KATRSQD IRRI QVKLKE I EPQKPLQKTN 
KSRSYFF* KH* ^NRTLTRVMENKREK/N 
EIDPIQNHKEDITTDPTEIQTIRKYYKH 
LYAHKLKKLEEVDKLLDTNTL 


755 


14656 


A 


761 


3 


321 


FPIiGLWGSIiGAKPEGQAPLYPPTLSKRV 
VPLC/DKGRSAVTFNTVYSSGSSRWFP 
PPCM*IYRLCIKGRSSDPEQKKKKKKKK 
KKKKKKASKKKKAPKDSSSPKT 


756 


14657 


A 


762 


315 


2 


KTERW/ VFEKINKMGQS *DGLTKKWREN 
/V/QINKIQNERGGLSTGSTEIQRVIRG 
FCEPL*AQILDNLEKMD*FL.QPFHLPRQ 
NYKKKKI I *KKPISKVIEDVIKNL 


757 


14658 


A 


763 


226 


329 


NPTTLEGQGGRISKGPEF/CSRDEVSPC 
WPGWFQTPDLWRSVRLGLPKWWDP/RR* 
APPPGLNXiFFRMLSTHGSW/CQHPCFKF 
PTSAFQYIYIYIF/SFLSRSFTFVAQAV 
VQWHDLGSPQPPPPGFKQ 


758 


14659 


A 


764 


335 


1 


QSQERPDS FQVMNIiTLPRAPSMFL VSPR 
TELSKSP* \PPAFLCPCVWPHSTLLSQT 
LKY*IKYP*SPTPFPSFVLF*RDRVIjLF 
HPGWSTLAGS * LTAASNS WAQVTLL 


759 


14660 


A 


765 


100 


310 


HFGRPKRENCIiSP*V*DQTIjHTHTHTHT 
HTHTHTHSRSENGGMRLTPHIPERVRW\ 

Girc 


760 


14661 


A 


766 


315 


3 


I FPNPPRI FI QNPHPFLFGFFPKKKEGE 
KNWAMETPRFSPPPLGGKKAPTGFPPPP 
IGF*GPKPLIKGRGP * /PP * K3SITKI FPF 
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residue of 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C~Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaianine, 
G=Glycine, H=Histidine, I^lsoleucine, 
K=L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, y==Tyrosine, 
X-Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFF*DRVLIiCHPG*SAVARW 


761 


14662 


A 


767 


1 


309 


KEPYKLTELHNEIRVACGHLSKKKKITW 
VSDS*ASGPSKTRGNFIGPSAGSRAPSE 
KLE AHFHGCGS VNRVHLT / CK* HKRRP L 
GVAHLEVSDQKSGRTSIjALD 


762 


14663 


A 


769 


1 


316 


GRAPPPPPPPPPTPRVHTSSKSESEPER 
/DGRE*EPTRSLERL/LPFASYFCtiEAR 

QSTSAI.PPLPLSPRSGPT\VLLCPSPPL 
PGERPPNPTKLSREKQTKQTHRAR 


763 


14664 


A 


770 


2 


366 


AREIIII/TF*QMITFLNTKGRT*SXiIIiA 
SLIIFIATTN\LLGLLPySFTPTTQLFI 
NLTMAI PL* AGAWXGFRS KI KNALGHF 
LPQGTPTPLKPILPIIETIILLIQPIAL 
AVRLTANITA 


764 


14665 


A 


771 


2 


364 


NAFPVI SGAWTEYPLSHFIQRCRQKKE/ 
LNGLQIGKEBEIFLFKDAMIj* lESPKES 
VQKRLDVIYKFSRIAVCKINIQKSNIW 
rVWNTQFENEVKTI *DPQ/DIKHWGIIL 
SQRKKE 


765 


14666 


A 


772 


3 


376 


HEPLGKIiKLSLLFIi:iATYSIiTVYSIL*S 
G*ATNSNYAXjIGALRAVAQTISYEVTLA 
I ILLSTLLRRGCFITDSTLMTTQEHL* LL 
LPS * PLAI I * FI CTLAETNRTPFDLAER 
ESELIiSC/FNIEYA 


766 


14667 


A 


773 


3 


350 


HEFFFFUjKMYLGQAQYFICFLFFFLGF 
HLK/HKS CS VTQAKVQRRHLGSMQP P P P 
GFMQFSVAEIiCDVHHHAQLIFYIPRIFI 
F /NFLRHSLALVAQDGVQWRDLGTLQ 
PLPPPG 


767 


14668 


A 


774 


1 


359 


GTRYAAMLS ALGF I FLFTARGLTGI VLA 
NSSLDIVLHDTDYVGAHFHYVLSIGAVF 
ALI P \GF IH * FPL FSGYTLDQT YAKIH\ 
FTIIFXGANLTLLPQHFFGLSGMPRQYS 
DYPDAYTTW 


768 


14669 


A 


775 


2 


369 


ARGSICLRQTELKTVIAYSSXRHIGLW 
TAILIQTP* SLTGAVILI I AHGLTYSLL 
CCLANSNYERTHS\RIIILSQGLETLLP 
LITF* *LLARLANLALPPTINLLGELSV 
LVTTFS*ANIT 


769 


14670 


A 


776 


2 


353 


ARGTGA* VDS*LTTLHG^NMK* CAA\LL 
\WTLRCKILFTVRGLTGMAITNSTLDIA 
LHDTYYWAHFHYVLS IGAGFAI IRGFI 
H^FPLFSGYTLDQTYAKIHFTIIFIGVK 
ITFFPQ 


770 


14671 


A 


111 


3 


353 


HEGLHL*LPKAHVETPMDG* lALDAGLL 
RLRGYGI IHVTLILNPLRKHILHPFLVL 
SI*GIIITSSICLRQTDLKSLIAYSSIS 
HI\SLEVTAILIRTPL\SFTGADILIIS 
HGLTCS 


771 


14672 


A 


11^ 


367 


2 


FCPI/CPNQKFM6VGAVGPALYPNPFGG 
LGGLV/ TPGAGVLWPAGPPG* TPPPPKW 
PNLGGGPALFFPPLKGLGWKIALTPEAK 
GSINPNSPPALPGGGPNQTFSKKKKRKK 
LMLVYSIELTSRA 


772 


14673 


A 


119 


3 


432 


HEPLHWLSSCVCPACVCVCVCQ/ CLWLQ 
VGVQ*ClSrYSSWQHEPLH*VHTRVYVCVC 
LCHSVAQAGVQ*CNYS/CTAACNSW 


773 


14674 


A 


780 


213 


466 


DDILPVWNYSISFLFFPENRAFFLPPGW 



568 



wo 01/64835 



PCT/USOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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nucle- 
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location 
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ding to 
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residue of 

peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Le«cine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknowij, *=Stop codon9/==possjb]e 
nucleotide deletion, V=possible 
nucleotide insertion 














GAGTQIj*LPAASR*GPKGPSRLiTLL*W\ 

DNRHGPPC / LDNF 1 FLQKKNLTLLPRLA 

LN 


774 


14675 


A 


781 


304 


373 


R*VFFFFFLKKEFHFFPQAGGQNQEMG* 
LHPLPPGAG/RF* CLTPPHSWDNGEGPP 
RPPNFCFPRGNKVSPCGP 


775 


14676 


A 


782 


389 


1 


PFPKKT* CRWSRPNVKHKTI/ QliLEGNL 
GASVDDIjEFGDDFIiDTI PKVQSMKERS / 
WDFI KIKNVCFAJKDNVKRRGPATNWKKI 
SVKDLSNKGLLPKIYEELLKLNNTET*R 
LT* KWSKVLNRQLTRETRA 


776 


14677 


A 


783 


1 


383 


GTSP*PL.TGALSYLtiMTCGIjAM**HFHS 
ITLLILGLLANTLTIYPGRGDVSRQSAY 
QGHHTPPVQKGLLYRIILFITSEAFFFA 
GFF* S F\ YHS SliS PTPQLGGHWS PTGI A 
PLNSL*VPLLNTWRLL 


111 


14678 


A 


784 


1 


389 


GTS IVIPTGViCVFN*LATLHGSNMK* SA 
AEL*ALGFIFLFTGSGLTGIVI.ANSSLN 
IVLHDTYYGEAHFHYVLSIGAVFAIIGG 
FIH* FPLFSGYTLDQTYAKIHIiTXlIFI 

GVNLTFFPQHFLGIiSGNA 


11% 


14679 


A 


785 


374 


1 


ELNAYW3WiyiNL.QNIi I WNAQ PL S IMQ I FQ 
I L I KS Q I QNTL WS I SDTGYLPG I DKWN 
CI^LKILCIVKVTINRARMPVIDWENTF 
S TYTNDKGL I P KYKELKHS \ KQ TNNL I K 
KWAKGLHSHSRA 


779 


14680 


A 


786 


.1 


363 


GTRL YHANTN * KKLRVAI L I SE K\ TD FT 
VKKIRNKEGHYIMlKRSIIi*EGITILYV 
GTPSNRWNYIRQKIiIKLPGEIDESTII 
liRDFNTPLSVlDASTRKKISKNIVSSNN 
IISQLDLID 


780 


14681 


A 


787 


1 


361 


gtlfsserknptwvslnqklemikliee 
gmlkaetghklrllqq/tsqwnakekf 
lkeiksatpvntrktrk*dslixadtgkv 

IjVACIED*TSHNVLFS*SLIQSKALTIiIi 
NCMKPERGE 


781 


14682 


A 


788 


1 


352 


GTRNYAKSTKSKLYRWNYKAWMTAYLFT 
AWCTEYFKPTVETYCSEGL/SLKILLLI 
DNASSH*RALMEMYKQINWFMLDNRIS 
LLQPVDQ*VILTFKSYLRNTFHKAIaAAR 
DNDSSD 


782 


14683 


A 


789 


365 


128 


PLDQHGETPLLLKIQKEARRN-GAHLYSQ 
LLGRLRHENLLHPGGRGC/SHCTPAWI 
E ^DCLKNQTKTKAPRRVSISHSA 


783 


14684 


A 


790 


217 


257 


WSGGVAHPVIPATQEAEAEELSLGGRDC 
SEIiRSCHS\CTPA*VTQAGVQ 


784 


14685 


A 


791 


2 


363 


LAILHQTVS *FVHAKEKFWKDLL \KSAT 
PVAI *MIRNLNSI.YNLIMETV* W*KED 
QTSHLIPLS*SLTQSKALIFFKAMKTDR 
GKGAVE * KFEATRGWVMRLKEKFCIiS /H 


785 


14686 


A 


792 


2 


362 


GQKLGIjLHQTVSKFVNAKEKFWKELL\K 
SATPVDI *MIRNRTS\LISDMETV* W* 
KEDQTSPLrPIR*SLTQSKATOLFRAiyiK 
P VRGKGAVE * KYEASRGWFMQYKEKSC/ 
RMCNIKIQG 


786 


14687 


A 


793 


2 


360 


ARAGSTMAFKNYAKSTLPVLYKWNKKA* 
MTAHLFAARVTKYVKS / YCLNKKI PFKI 
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Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^'Methionine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














^VFIDIVPSHPRALIGIYKE/INVFMPA 
NTTSTLHPTDQEVISTFKCYYFKNRFRK 
AXQLPYSNSY 


787 


14688 


A 


794 


2 


359 


AREKL YLS ILTPLQRSL VI AFAATELTR 
YYILFETTLI PSLAISTR* GD\ QPERLN 
AGTYLLFYTLVGSLPLLIALIYTHNTLG 
SLGI LLLTLTGOEIiSNS * ANNL I *IiAYT 
lAFIVKIP 


788 


U6S9 


A 


795 


157 


365 


G I lEEKG YLPEQ I FNAKCSGMI S AHCS P 
RLPGSSYSTSA*TDF*CQVQWYDriSSLQ 
PPPPRFKIiPYLSDPSTWDYILAPP/RPS 
**FVFFVEMGFRQTPE 


789 


14690 


A 


796 


224 


372 


I FF I F I F I YLFTE WRQSFALVA\ QAGVQ 
WRNLGSL* PTHPPPRVKRLSCIi 


790 


14691 


A 


797 


482 


1 


NSFFFFFFLFPEAKDHKDFPSPPFPVFK 
KLKGFC*APP*VQKPVFYLKSF/CLKDY 
MLSFPESKVNFPSPPPL/SPSPLFMVGD 
PPLPPETYFSFCF*DPHSINFFLPPGPF 
LPHFFFFLRWVLLCHPG *STAV* SLLTA 
TS\VK*FFxHLSIiPSSWVYRCTSC 


791 


14692 


A 


798 


1 


415 


NLGGGGCSELRSYHCSPAWATE*DSISK 
QTKTL /NKDHTRAGWERA 


792 


14693 


A 


799 


2 


401 


VQTGFHHVGQAGL * LLTSGNP PAS AS / Q 
SAGITGMSHRARPQ 


793 


14694 


A 


800 


73 


307 


PMALEHHGCGMCLDFLPTFGKSHCFVLR 
CAEMETRSFLPSWSAGA*Y\CLLQPPPP 
RFS*LRLPSRWDYRHLLPCPAN 


794 


14695 


A 


801 


87 


401 


SLIEIWTLKKSTGHNLYNWLS *HSLKA 
I VLARHS / VESLTHVLKiyiCLV*NFVFAF 
S*RSLCFFLKI SRP VWAHVCNPSTLRG 
*GGHITRSRDRDHPSQHGEIPS 


795 


14696 


A 


802 


3 


354 


LRHYTP\PG*QSETLPFKKKKKKKKKKK 
KLiSFFPPPKFLKKKKGLFKNPFKKKGKI 
FFNPPPPKKKGFFLIVNPPPKKKNPPPL 
GGGGPPICKIYF*KTFFFAPPPKFE1^PFF 
FFSPRF 


796 


14697 


A 


803 


1 


830 


VETGFLHVRQAGLKLLTSGDLPTIiAYQS 
AGITGVSHCAWLFFFF**GIAVTQTEVA 
PS *LTIASN\PGLKIiSSFFTLPHHARLI 
FKIFSRNEVLLFSRX PSQTPNLMQSSCL 

slpkcwdyrceplypaeslsfXfiklsc 
mlkclevkcndvcnli.*krtlikwinewi 
egwigrltcvkqieknvicri * VMLKAT 

QP VGTE PS * ESRSTNSGBAMW * S KDNTP 
FPVFVSVFVLRQS/liCFVAQAGVQWSHL 
SSIiQPLSPRIiK*SSCLSLLSSWDLRARA 


797 


14698 


A 


804 


389 


3 


FIWPFKGiWPENSFFVFMWIKGHKCPKKG 
FGRKKNQL6PKIPF * KKKKKGAFPLWLK 
RFQGPFFFFFEMEFHS\IAQAGVQWCDL 
SSLQPIiPPRFKRFSCLSLPSSWDYRHAP 
PRP2^F\CVFSRDEVSPCW 


798 


14699 


A 


805 


161 


21 


RGGGCLQSQFR/RRLRHBNRLNPGGGGC 
TEPRSYRCTPAWARE*DSVSK 


799 


14700 


A 


806 


405 


82 


FFLRWTL/DTVTRGGIQWCNLGSPQPP/ 
PPRFKRFSCLSIiSSSWDYRRPPPCPANF 
LYF* *RRGFTMLARLVLNP*PRDPPASA 
SQSAGITGVGHRAWPMPIFENRFDL 


800 


14701 


A 


807 


938 


2415 


KITFWETFWITXVHPHLCKEREAIAGIIi 
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E-Glutamic Acid, F=Phenylalanine, 
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K-Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
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T=Threonine, V^^Valine, 
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X=Un known, ^=Stop codon, /=possibJe 
nucleotide deletion, \— possible 
nucleotide insertion 














RRWS *NNW/MIHTHTKNLNPYLTPDTKA 
TFK*IIGLT*RAKTMQIi/LESFCQKENT 
GENLSDLGVGKDFLRHKKHGS IKGEKIA 
KLDFIQVKNF^SLKDTFKKMKKYALGWE 
KIFAERVSDRGCVSRRYKEL*L/IELKD 
NPIRKGGNNDNKTOQRI *MANKHMKRCP 
KS*VIRE I * I *TIMRYHCILPRiyiA\iTVlN/ 
SD*SHGDKNGGSSGTLIH\CR*ECEiyiDQ 
LI.WKTIGQFLSK*M*RCHTDSSIPFLII 
QEK*NCISTKKTCTQTFTAA/MYIjLVIA 
KlSrWKQLPYPSSVWl/QQIWCIYT\MEYY 
SAIKRTVDTLNN\QMDLKIIRFNE^S/H 
NLRRVHCMY 


801 


14702 


A 


808 


414 


1 


KPKAKLFVPVEQVVNAKEKPLKEAKSAA 
PLN\RMIRKQTSLTADTEQV*VIWrEY* 
TSHSIPIjSQSLIQSRVLTL\SMKAETGE 
KA2^ENIjEDSRG*FTRhKEKSIlhHNlKV 
QGEAVSADGEAAAGYPEDLAKI IDBR 


802 


14703 


A 


809 


399 


1 


TGPPPRNQPFNKPAKGNFWLFLFFFL*F 
/CNFFFFFPFTGAFIiSIiFGI \liV* CLML 
VWGNFSP*PFKYCFTFSFFLSFIFLIFL 
WLLLlSfTCLYrCYTFCSYLTVL*CSFLFS 
S IFFLLYF* FWRLLLIHPPPTRP 


803 


14704 


A 


810 


1 


392 


ILLiAGAIEDAEPGSG*TGYPTLTGSYSH 
PGAYVYLTlLSLHIiAGDSS ILRAINCI \ 
TTDINIKPPAVARYLTPLFV* PVLDTAM 

LlilLSLPVIiADGVT ILLTDRNLYSTFFD 
PAGRGD P I LYQHLL * LLRH 


804 


14705 


A 


811 


66 


411 


RETRAGAQGLSRFSPPWGNPSQKI FLGP 
GFKKNPGPQKKGNPFFFF*KKFPLVSPG 
GRGGGNFRSLQGPPPKVPPFFCFNPPRK 
GG^/MGAPP 


805 


14706 


A 


812 


3 


392 


LSVPQFPPFFLKKTFRNCPPFCFSPPNI 
LGFPLKGS KK* GS PFPFFPPPWYS FFWF 
PSYSF*NQPCLTRGPF*NQKFPRL*KGV 
PILRL*SPLFFGPPPHSF/IFFFFDRVS 
LCHPGYSAVAQS /LLTAASS 


806 


14707 


A 


813 


690 


153 


iisidakkafyeipppsvissssptdsl 
yynlgflknfkkgre vkylni ikyi yek 
ptani i fi sgklkafslrsgtrq i cpls 
plqfnkvlevlaratwqekeirnihiek 
ee vkls lf vsnilci ez^rplkmp * ktts sn 
* * iqrvagyktisii*k:is\aflytknklf 
cclshpiydipl 


807 


14708 


A 


814 


32 


376 


IiFGLARSYITEGGRIiPENPTIPHG*REF 

welcnkcdXtmrpkpslhgsrcghcvtr 
mdhhcpwinngvgedkh*lflqlcfyte 

LLTCYALMI S FCHYYYFLPLKKRTLVRN 

VYI 


808 


14709 


A 


815 


2 


423 


YPL YPFKI FI FPKGFNFCRE VGP I CPPP 
KI KVLS KNSQVGF YTAP YKEKIMTLPAR 
VNF6PPKDSLKRPPLFFFFFF*RDGIIiL 
CYPSWS*TPGLKQSRCLGLLKHWDYIC* 
PLHPASF*KKHYSLRILP/SLLADAW 


809 


14710 


A 


816 


404 


54 


SLAHFFPPPKKGFFPKIPRGVI,*PPP*K 
KKFLFS/HPPVNLGPPRDPLKGPPPSSS 
SSSSSSSSSSPGG*MIFY*/P**FGPAIj 
VQPMACYCKASCKVKAPFFSTHSGLMVA 



571 



wo 01/64835 PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
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nucleotide deletion, \=possib!e 
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PPTTAQS 


810 


14711 


A 


817 


3 


410 


DAB VGGS LB PG \ S GGCS EP * S CHCTP AW 
VTG^ETVSKKKKKKKKGFFPPFFFKRGP 
FFRE I EKKFGKKF F S KKKKKKKKKKRGG 
PFKKNFWGGKNLRGGKKKFFFFLGGEKK 
KPRGFFEKKTFFWGGKNWATTTQK 


811 


14712 


A 


818 


1 


404 


IPINSLTSKINKLLKRHKLP /HLNQEEI 
DmtTRTXPX*NINF*XYKQNTGPDGSXS 
KFy*TIRKT*Q*FYTSFQKIKNEEVLPN 
SFYEASTPIjASPKKKKKKKKKK 


812 


14713 


A 


819 


421 


2 


lppkrrcfspkpprgfffaps *gkkni / 
cfppr*nlapp6vflkgpplffffffff 
frgrvg*swvpaaspi,qgqvflppq/ss 
* vggspgppeharvnffffffpff*f* * 
rqsltmlpslvsnswa*tiltpqppk:vi, 

GLQ 


813 


14714 


A 


820 


84 


418 


ILTCHNARLGEKSQIVTMQASQNNPKLV 
NICEVFLVFFFWKPIFFGTQVGGQYFNL 
SSIi*VRIiPGYTHFFCLTLPKSWDYRPIP 
PS\P I FFCPFNKKGVSPGYPGGCNILVF 


814 


14715 


A 


821 


290 


3 


FSPPPENIiGPPRNPIiKGPPPFFFFFFPF 
FFFFFFFMPCVLIY\FIFFCTNLLFLDF 
CIT* *T*NWFISSKAVCLFBFLPLQPAS 
QKTKNRCRKSFF 


815 


14716 


A 


822 


416 


2 


ITKLTEEKKIWSGKRBCNDYVKYAGASQN 
YTAWLRPQSMITLKDSSERKAPHWGEL 
NW\HVHHFIWKER*PEV* IYIES*AID 
GGLASWSGPEmKXGKLGAKKV^R-RDMK 
TDPLECTQNINIFGFVCLFGDRVSLCW 


816 


14717 


A 


823 


409 


45 


PPPPFLFFFF*KKKKAPGGGSQGPKF*P 
P PPPKGKP PFFKKKKKKPGGGGGPPNP P 
PPKGGGQK/QFFPPQGGNFNKPKSPPPQ 
PPRGQKKKPPPQKKKKKKKKKKRIiNVTG 
PQFLQLESGTY 


817 


14718 


A 


824 


1 


408 


I F3EWG I KL * INTKRMFRNCTNI WELS 
NMLLNNH\*VNKEN^NKKFIKTNE/NKN 
ITCQNL*DTTKWIiREIYSKKCLLQKKK 
KKKKKKKKKKGGGALimYLGGGQFYGGE 
GNF IFF F WGGAKKP CGWI FRRRP FFW 


818 


14719 


A 


825 


259 


274 


YL* YLLL YRRYSA* I PL/LGYYLKKRKL 
LYQRDTYTP!yiFIAAIjPTIAKMWSRP*SP 
W 


819 


14720 


A 


826 


422 


105 


FFFSKIPWGNFFPPPPKRIFFPPFPPKI 
FFFPPPPFFFWGGFSPFSPPPKKVFFPK 
SPPVFFFPPP*KKKFFFP/HPPLIFPPP 
GFPPKPPPPFFFFFFPFFFPPFFW 


820 


14721 


A 


827 


15 


402 


IKSLGKNIGINI.GDIiVriNNGP*YTLPKA 
QATKEKIDN/WDIiAKAKNI CA* KDTIKE 
VERQPKK* QQTFVNHLFDKGIiVSKHV/N 
EIi^LINKKTNHXHL* *WAKD*NRACSKE 
DVHMTDKLMKDVPSLVIRE I 


821 


14722 


A 


828 


420 


3 


KDAHTCPPGCKGGFSGHRAIiKAI CRNQG 
FIiLTTTEPIjTHKCPLPGQEPWGQHQG*R 
DSPRRVSPAPGTWQPPCHRERACLAPSA 
VE/GPS * IQEQEKSLIiFFFFSIiliRWSL/ 
WSVAQVGVQWRSLGSLQPPPPRFKQFSC 
LS 


822 


14723 


A 


829 


1 


269 


IKIKN]:iARHGDSCl.*SPIiLLEMLRLEEN 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LF=Leucine, M=Methionlne, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possibIe 
nucleotide deletion, \=possib1e 
nucleotide insertion 














LS PGSRGCS EL / CQPARQGETLSQTNKQ 
TNKKPHQVQLPCFEIINFKEFITCLVLR 
LPGNSW 


823 


14724 


A 


830 


432 


1 


GYNNQQIFDVDQTAFY*KKIPSRTSIAR 
EEKSMLGFKASKDRLILLLGAKVAGDFK 
LKPiyiLVDPS^TPREL]SIN*ARCTLPVL/V 
KWW/KAWMTAYLFTV*FIQYPKPTVDTY 
C/FKILLLIDNAPGLPRAVMDR*EESNV 
FMPVWTTL 


824 


14725 


A 


831 


1 


233 


KKIiQ I KYLGINLMKYT I HWS I CQKLLM 
KE INEDIjNKWRDVtiCLWI *RIiNNiy!S I LH 
KLIN/RINLLQLRr 


825 


14726 


A 


832 


3 


477 


YQTCKEDLTQMLLKIFQKIKGEGIIPNS 
LYDAS I AMMPKSDKDRTKKFXNYRP 1 SL 
MSIDAKILNKILPNQIQQHIQKFIFIPE 
MKG*FNIRK* INLIHLITKMKTKTPMII 
SIGTEKVFDKIKY/PFPITTLTKLGIDG 
RSLDAIMVARE 


826 


14727 


A 


833 


3 


402 


RYQTPLLVX * TVL ITTALLGLSLPVLTG 
GITIPIiTDRRLDTTFFDPAGGGDPILYH 
HLF ♦ FFGRPEAHILILPGWGI I SHIGTY 
YSGKKEPPGYTGMR* AI IS lAFLRVl V* 
AHHI YTV* IDRDTRAYLPSLHN 


827 


14728 


A 


834 


7 


395 


DPQRVSCMALSSNSFFFFFWERKSLFFP 
GREGRGQIWVNGTPPLQGKRNSPASPPG 
GGG I TKKAPKLHPPKKGWGGGLEKPP *N 
QRGGFG I * S PGGKKGFAGG* KQGQKS KG 

GGKKKGRA\DP*AYIPLK 


828 


14729 


A 


835 


41 


444 


DPRVRKKERENL YRS VS IKEIESVI*NF 
PTKKI*GIiDGITSEFIQIIiPKKKPK\IK 
KERTFSNPFDKTN I TL I / S KPETTFTKK 
EN/VRPVSLMNI/DCKILIjKVIANCTIjX 
HI*REIHHDS/lNFIPAIQISFNrQKTJSr 


829 


14730 


A 


836 


3 


415 


HAYYIVKPSP/WIPKGALSALLKTYGLT 
M*MQFQSIKDLRLGLLTNTLTIYQ*WRD 
VTRKSTYQGRHTPPVQKGL*YGIILFIT 
SE VFFFAGIF * AF YQSSLAPTPQLGGHW 
PPTGNTPDNPLEDPILNTSGLLGIGVS 


830 


14731 


A 


837 


378 


0 


TP P KGPGGKI FLKK/ SPGRKI F * PPGNG 
/PFFSPLSPLKFFFFPKAPNFWGGGGPQ 
GPPPKKGVFSQNPPPGKKRPPQKEKKNF 
FP/PPGKMGPPQGFF*RAPPP 


831 


14732 


A 


838 


383 


38 


GPGEKIFLKKPREKKLSPPGEKAKFFPP 
SPLKNFFFPQGVFFLGGGGPKRPPPKKK 
GFFQKTPRGF* lAPQKKKKKNPPPRGKP 
GPPQ/RrF*KGPPPPFFFFFFFFFFFFP 
WSL 


832 


14733 


A 


839 


59 


418 


TKISSKDQKQY\TTPGW*iyiAEISTTFED 
AEIIDALIFPFNLPIWIiHKKPDCSWKVS 
VYCKLK* WSLISATGLDI INT\LLEEM 
KRVSGTWYIATDIiAKTFFYI /PDQKEFA 
FSWYG*KDIF 


833 


14734 


A 


840 


5 


335 


IRHLF*INKFENLDKiyiDTT*KLTYEGIE 
SIiN/RXr*NF/PVVKSLGSGSFTSBFYS 
LFKEEITAIYKPIWRVEKGGI \liPTCFA 
TRITL I PKPETC IMRKKWCGPVLFMNGH 
F 


834 


14735 


A 


841 


1 


38 


RRLKLEDHIiSLY*KINSKWIRELMrrR/P 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F^Phenylalanine, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/— possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ETIKLVKKDKGNVXQDTDPGKDFMVKTT 
KTKIDKSDYIKLQSFYTANETVNRVKLQ 
PIE*6KIFKNYSSGKGLLSMIYNELKQQ 
HRNWITLI * KLEDHLSLY 


835 


14736 


A 


842 


2 


424 


GRVGSIPLSQSLIQSKALTLFSFMKAET 
GQEAAEKKVEANRGWFTRFKERSGLGNI 
KVQ/GAGGDTAASSPADPAKINDEGGHS 
/ KQQ I FSGNETAF Y* KKMS SGTS I AREE 
KSMP I FKASKDRVTIiL IGAWAA6NF\ Kti 
KPMP 


836 


14737 


A 


843 


281 


3 


FFFFPPFPFFFFFKTFLFFGYWFSIFFP 
LLFSISLLLSPSGSFVTFAPQPFF*VSR 
YHYQHFNSTF*FIi\KL*FIF*IFFDISI 
LFLFHGYNF 


837 


14738 


A 


844 


409 


I 


RCGGTCP WS P VLRRIiR* KDHt,SLRG*GC 
SEP*LHHCTPAWATE\DSVSKKNKKKGI 
S CRQHI VGS FFFHIiKMCLLNGLLS F FTL 
NVIIYWEFKFTILLFLLYSICSLSHLF 
SFPAKSWINSKFESLFVCFNSFS 


838 


14739 


A 


845 


63 


451 


KNQEQESEETLPNLFYKASITLIPKIiDT 
QK* KKRKEKKRKKKK\EN* KP ITHRNI Y 
AKILNKIIiAKQIQQYIGKI IHHDQVGCI 
P/GAFDKIQYRCMIR/TLQKMGIEGTHL 
NIIKAIYXRPTDSIIENREKP 


839 


14740 


A 


846 


60 


460 


RNNE P FLDQ I VT YFEKW I LYDNR* * / PT 
QWLiDGEEGP KHLPKPNLH / Q KKWVT VW 
WS AS liTHYS FLNP \ ET IT S ENYS QE I DE 
MH* KLQSLQIiAS VlSrRKGQ I PLHDNPRLQ 
VAQP VLQKINELGFE VLPHPP YS PS 


840 


14741 


A 


847 • 


344 


2 


NFLKFHPGGETFLQKGYTGYFSPFTP*K 
FFFSLKPLNFLGRVGPFFPPPKKGFFSK 
I PQGGL I S PPLRGKVFPP P PRLNLGP PR 
VFLKAP/HPFPFFGI\PHGLQPRPMI *K 
PTRP 


841 


14742 


A 


848 


439 


132 


RRVAAPPPSKNIFFPPGSYNCGGVWPQK 
SPPPK*GFFPKP\SGVYKNPPQIEKVPF 
FPHARIVPPPGDPIKTPPPIFFFCFNSI 
NFS LRGAQIiYVWDPLGSGC 


842 


14743 


A 


849 


94 


470 


LNHFLSFFLiSFFLSFFLSSGSLSLSLSF 
FFFWKGSPPPP*GTK/QKPCLiKKKKKKK 
GGGGGQNFFTPPGGKKNPPPQGGGGGGS 
PAQKKKP/EGGGPPGSBOCKNPGGGFKKK 
KPPKKPRGGGGKPFFP 


843 


14744 


A 


850 


2 


396 


ffkkewiqmankhvktyttllviremki 
ptsmryhfts irvakilktdntrc*qeg 
rtirilmlc*lqnkmvliil/wknwqflt 
kikiflfXfdptisllgxypremktyvh 
S SF ihgnskimrtgnnlnvhn 


844 


14745 


A 


851 


369 


36 


skgasdilekpplf*evglptpspplge 
kmp/fnykggpgkgisllifpkkkfipp 
gghfpfpfp/pfffmdrvllchpgwsav 

AVSQIiTTTSASRIiX K*FSCIjS YSEG*DG 
RIS 


845 


14746 


A 


852 


1 


367 


PPPPKIKSPSGPGPPSKRGSPGAPPRGG 
KFWFLSQKGWGGP*FPPPPGVKPE/NIP 
* PPRGRVPL/HLNIzAPAPPPGPPK/PKP 

PSP 


846 


14747 


A 


853 


1 


378 


AMLATL I SNS * PQ/ CD P PALAPQS AGIT 
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N— Asparagine, P=ProIine, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, =^=Stop codon, A=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














GMSPCARPTSCSFYP* ILLTVRTFIKIR 
NFFFFFDTGSWFVAQPGGEGHDHG* * * P 
QPPGPIiTKSFWLSLPSRWDYGGAPPCPVI 
LCIFCKDEVSSCGSG 


847 


14748 


A 


854 


369 


1 


SACFGLPKCWDYR/R*ATTPSLEIIiyi 
ER*EKTTSPDNAlSrRFDVTKLHHYDCSVM 
DF*KAI,*NGDDFPVLMSKMHMQSLNI I \ 
PVFCFLRQGLALSQ /AGVQWRDHGLPQP 
QLPRLQPSSHLSL 


848 


14749 


A 


855 


380 


335 


NPGGRGCSELRLCHCTPAWAT\SETIiSQ 
TKIQPKKQEV*LL 


849 


14750 


A 


856 


373 


1 


LRIKILNKKGLANLIP* *R* KI IHSQVE 
PIPEMQA*PIIRK/TPSIKWPIVRTKKiC 
KBCSQMLISlDTBKIFDKIQHLiFLVKPVI 
KLGIEGTFINFIKSIY\KKSTTATIISG 

LNAS PLRLKTRQDF 


850 


14751 


A 


857 


369 


44 


ICVQNLYFEKKLLNKIKVDLNKW/NSLCT 
RFRKLNIANIIMFKLIHRYN/SKVKIPI 
KIPANCFGEIDKLIIjKFI*KFKRPQIAK 
I ILKKNKDGYILPDFKTYYIYMQYHLC 


851 


14752 


A 


858 


1 


348 


QWHAPAS/IiARPPPPRFK*FSCLSLPNS 

WD YRHAP PRLANFVIiFCFVG * GGIHL 


852 


14753 


A 


859 


1 


366 


CTS PTFNQKLEM I Kli/ EEGMLKAETG * K 
LCLLCQKV/QWNAKEMFLKE I * SASLV 
NTTMI\RKQSFTADMEKV*W* ieyqts 
HNIPLSQSLIQSKALTIiYSSMKAERGEE 
AAKEK/LEASR 


853 


14754 


A 


860 


12 


366 


PS TLGLRRAS CLSLLSRWD * RHMPPHPA 
N*KNFFCRDG/SLTiyiLLRLVL\NSWPQA 
IHSPWPPQ 


854 


14755 


A 


861 


424 


78 


NPRPLEGQTASFSIiAPNF*TTLGK:KVNP 
FF/SLKNPPPPPPPPKIWAPQGPFGPPS 
PGGLNGGFFIiPPRVKAPIIHKGAPPPQF 
PPQKKKGPRVFPKKKKKKERKKEKGMTR 

AFCYP 


855 


14756 


A 


862 


327 


3 


SHWFFAAV\GREISM*AiyiAPDQTKKICP 
RSAEDAI KYFLTQATGS I II/IRAI LFNN 
RLSEQ * S ITNTTNQYSSLI I IMAIAIKV 
GMAPFHF*VPEVTQGSPIjTSGLLVIj 


856 


14757 


A 


863 


1 


341 


YDRNKWKDIPGS* lERIjNIVKLSMIiPTV 
I YRFNVIIilKIPMTFFASTEKS I VKFSR 
DYE*PKQS*KEQNWKTHTP*FQ/QFPTA 
TVIKMVWY*NKDRYI Y/DQWNRI * SLGI 
NPCI 


857 


14758 


A 


864 


32 


324 


IjVWPFFFFFFFF*KKKFFPFPPF*RKGK 
NFFFFTPPPPGLNLFF/CPHPLKKWEKR 
APPPPPN* FFFFLKKTGFFFFTQK6LNF 
PPLETPPLTPPKGIjN 


858 


14759 


A 


865 


350 


3 


RVKNPRPFWGF*MVLKPLSFFSKKTN* I 
LFPLKIFSPPKTVPWGKIPLGAL*NPFF 
CFKNPLWFFGF*KIi/SFFFPPLYFF*KP 
IiAPLKRRFSFFFFFFL*DGVSLCRPGWS 

AVARSR 


859 


14760 


A 


866 


342 


118 


GS VTQAE VQWCDHGSLQPSTPGI K* S /H 
PPTSSYQWGTTGVCHPGLAMLSRLVLS 
SWPQAILSPWPPTVQGLQV 


860 


14761 


A 


867 


1 


354 


VKPSP*PLTGALSALLMTFGLTM* IHFH 
S ITLLILGLLSNTLTI Y\Q*W\RDVTPE 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Liysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro1ine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V^'possible 
nucleotide insertion 














S T YHGHHTPPVQKGLRYGI I L,F I TS EGV 
FWAGSV*TFYRSSLSPTPQLGGHWPPTG 
ITPLN 


861 


14762 


A 


868 


3 


344 


QIKNPDNISC/WIRCGATGMFLHCWWEC 
KLLQP LWKTT * HQ * SRR* AVPLSGMF P / 
NRYS STCTPAS I * KTFLS ALFMMTPL / C 
LSRVEWIK*SWC3SrQKL*y*SVMKRSEEF 
SLKNA 


862 


14763 


A 


869 


345 


1 


KQAGLKNVKITG/EGASVNQSR*VS/YL 
GTIKKITEEKGYLPERVPNAGASAVF*G 
KKIiPQRTFISK/EEKQAP/GSEVGKDRIi 

TLLFCAISTMSRFMl STALLYKAADLQSLK 

GKDKHRI. 


863 


14764 


A 


870 


340 


45 


FCS FCRDRVSLCCLG WSRAPGLK* SS CI* 
SLPKC*DHRC/AA/VPAWFQRCILEKKI7 
QI Y* CAERIIi* SERPHTRHLDS STVNI Ii 
PCPLSKVI FWKKKNQI Y 


864 


14765 


A 


871 


209 


72 


KLWDIHTMEYYYSArKRNKPLKYTTWMD 
LKDIMLGEKANIiKRSHTV 


865 


14766 


A 


872 


3 


353 


SFFLGPPPFFFFFFPPPQPLPIFFWRFF 
FCFSPPKKPPP/PFPL*GGSPPPPF*RG 
FPFF*KKKGGVFPPPFFFFFSPKVSPPP 
PEOCGGGG/PPPPFFP 


866 


14767 


A 


873 


3 


267 


DFTMIiARLVSNS / WPG WAHTCDPSTLG 
G*GGWITRSGVRDQPGQHGEI 


867 


14768 


A 


874 


1 


346 


PRRPFFFFFFFLIVFIYFKCVYLF*DKV 
LVCHPGWSAVPHHGSLQP^PLRIKQSSH 
LSLS S S / WDHRHAWLVFVS FYRDEVS P C 
IiANFCIFL*RQDFSMIiPRQ/VLKAICLP 
WPPSL 


868 


14769 


A 


875 


1 


172 


KLLS SGS PPASASQSAGITGVSHRARPG 
IiLiNFFC*CAFSVPGPCLGYPDTPTH/LC 
PPSFHQSVIASLDFSCLS*L*MSP 


869 


14770 


A 


876 


1 


348 


LLFCFNNYSFFHGVKPKIPAFFVFGPGC 
LPHFFPPITPAPFFPLDRVLLFPPGWRE 
MGPF*APPTFSPQG*GVFPPPPPQ*LGP 
RGPPPQPGVFFFFCIFGRDSA/IAILPR 

Ij 


870 


14771 


A 


877 


1 


203 


GFHYAGQASLEVLTS * STCLSLPKSWDY 
RRGPP / * PGLS YFLYPSLRS * FI CAMS I 
HIPFIKKKKASD 


871 


14772 


A 


878 


343 


102 


EWEDCPSPGGRGCSEPRSHHCTPAWVRE 
TLS P KKYLFLQP * KLKNTKI YLS 1 S I P * 
EIiET\lLKILHFKMMNPLHSYNFFF 


872 


14773 


A 


879 


2 


359 


RDITKGDMQMETICHM*SFSTSIiTSGI \H 
NS PIRPS KMKNIDSTRYWRGCRAIRAL I 
HCWWRC*MVQPFGB* ISSFSKKLNMYLS 
YDERPTFRYEK* K/ P * VHPKI CV*MFMA 
AFFLISPNW 


873 


14774 


A 


880 


205 


1 


PFRGVTEGIi*EPPYVESVI/AGGTTARR 
PLFFFFFFFFFFLRwSPALVAQAGVQWR 
DLGSPRPPPPGPK 


874 


14775 


A 


881 


150 


2 


CRARVDGVPWRNPGSLKPPSP \GSSDPP 
TSASQECGITG2iHHHTRIjT*VF 


875 


14776 


A 


882 


345 


1 


KGNQPWKTEKRALFC/TLKKKKIFKNTSP 
PRGNP*KFPVKKKIPLKKEKGFSPTWGK 
KKEFFPKKKKKKKKK* * AKDLNRNPSRE 
DVQMAKKHMKRCSTSIiI IREM* I KTTIR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Plienylalanine^ 
G=Glycine, H=Histidlne, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine5 S=Serine, 
T=Threoiiine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codoii,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














YHF 


876 


14777 


A 


883 


2 


355 


DRLLFSASHLDLGTLYLLFGA* SGVLGT 
SF\SLLIRAELGQ\SGCLL*NDHIYNGI 
ETAHAFVMDLFIVRPIIIGGFGN*LGFP 
NKKGADMAFPRINNISF*LIiPPSIiLLLIi 
ASAIVSA 


877 


14778 


A 


884 


262 


2 


PTCEQSEREIKKTIQFTTESK* XKPVGX 
NLAQEAKDLYTETIKYKTLLKEIKDTNK 
*KNIIYV\SWTRRFNNFKMLGWAQWLMP 
VIPA 


878 


14779 


A 


885 


16 


318 


ILRADCADLFFFFFLGGKKGFCFPRLKG 
RGETLiLN*TLIFR\VKGNF*PKLPKlW/ 
DFKGWPYHL/AENFVPFLKKGGLTLLPG 
LVLNS* IKEPFHLWPPKGVA 


879 


14780 


A 


886 


298 


322 


KRRTP* yPPGEPHFPPPPPQG* KARGAP 
PPPPKKGFF\PEKKKKPGVMVQGTTQKP 
PKKTKTOVD 


880 


14781 


A 


887 


353 


3 


FPFWGFLKTLFGKAIiWLFKGFFFFAHQK 
PFFLPSLPPPP\LGFPLRGLT*PKIFSP 
LF*KGAPQKGISHPLFFFLISPKPPPPF 
FPFFFFPPPPFFFFFPFFFFS \RDRVLL» 
YCPGWGA 


881 


14782 


A 


888 


22 


341 


IPCTCIiKIiHGKVHDHDQSQLWWPKPQEE 
MRTLENNLAVLQS I LQWLD I ELP YDRAI 
I CARE I KTYV\QQNCT * T FMLALCI TAK 
KW/KQLKGPSTDEWVSRMWYTCTR 


882 


14783 


A 


889 


51 


338 


ERS QLQWLMP 1 1 PSLFFFFETQFLFCCP 
GGRAWAIFKIFEFIiA\PK*KQFSCLTLQ 
T I * VYGLNP P PRKNF VFL KERGLiFHVGQ 
SGRDIiPPSGDPP 


883 


14784 


A 


890 


1 


225 


GRLRPENHLNPGGRGCGELRLCYCTPAW 
VTERDT I S /RHTHT *NYF Y * GS I *QLAN 
SCCCNMLNKFNVI CILRWC 


884 


14785 


A 


891 


357 


164 


GKGCSDSRLCHCTPAWAT\TKTLSQKKIi 
ILKKQNKFRS*LDD* INMRCTVWPCFHV 
FLF IRAAPIjFSDWLYNK*iyiNRNT 


885 


14786 ■ 


A 


892 


207 


302 


EPFSGIIINESIHQEGIIVLNVYTSSNR 
PSKYMKQTLIELKGEKVKSTIIVGD/FY 
THLLVIDRTSR*KR 


886 


14787 


A 


893 


86 


332 


VMRVSCCCLKD* ISLCHPTOSAWQS*L 
TVASNSW\VKQSSFLGLPALWEAELGGS 
LEVRSLRPVWAT* TP \ FCKNKKL 


887 


14788 


A 


894 


2 


336 


FFFWPPPQTFLKNPGGPGPKREEGPGGS 
PRGQKQGPPGLHLTGFGGQPGG*KNPEP 
GPGGRAPKGETRGEGGGPTRPQ I P / QL I 
NGKSGKPPKVTLNGAWGTIKI FL I KTP V 
G 


888 


14789 


A 


895 


1 


214 


ARESLASFIiPPSLPPFULGLRPLPSFLP 
FYFSFLPSFLGFGSPLPST*VPSFLPSP 
LIiPFPSFLPWLWFLPS/FLPSFLPSFLP 
SW 


889 


14790 


A 


896 


131 


352 


TLHESDSESVPRDFKISDALAVEDDQRS 
PGTLNAAELS/SSVRERKKKEK/KPEPG 
li* DQS I KESDSYMVSGGRIQ 


890 


14791 


A 


897 


3 


184 


CSVAPAGVQCHDLTSLQSPPP \SSSNPP 
TSSNPPTSAF*VAGTTGMCHHAWLr PVF 
LVDAA 


89 i 


14792 


A 


898 


327 


264 


NRAVSLMNLDAKVIj \klisasqi yvyik 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
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26 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence {A==Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla fan ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginej P=Proline, 
Q=Glutamine, R=Arginine5 S=»Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y— Tyrosine, 
X=UnknDwn, *=Stop codon,/=possible 
nucleotide deletion, \=poss]b]e 
nucleotide insertion 














KMIH/HNQIRFTiyiRM*GQFII*TAINIY 
HKL I K* GRKHMI PL I YAVKI FEKI* YHN 
I MI KTLHKL * I KE PPGS FIYSL 


892 


14793 


A 


899 


3 


313 


TKAASHSQ\RANIiQFLVGRIH*HLKSRT 
ISPGRVGATAAVNSTAILEYLTAEVLEL 

ARNASKDFTVKHTTPHYLQIiAlRGDKEL 
GSLFKSTIAGKGVIPHIHKS 


893 


14794 


A 


900 


340 


1 


HLCPHKEHLY/ liLQTSCPLAI I * FIST I 
AEHTKAPFDMAKVESELVSGFNIEYAAG 
PFALF F t AE YPNI 1 1 INTLT VT I FLGPT 
YVALSPEIiNTTYFVPRKI VWRVFII IM 
FV 


894 


14795 


A 


901 


1 


252 


LTFFPQHLLGLSGMPRRYSDYPDAYAT* 
NILSSVG/SSPSLPAVILIIFMI*EAPA 
SKRKVLIVEDPSINLrE*LYGCPPSPPSF 


895 


14796 


A 


902 


326 


1 


LGPPPPLKEPPGVFPIFPF\*FWNFGPV 
GPF*KII,IGPLIIiDFPQPFFGGVSQIVF 
PPF*GKIPFP*KNTPP.FP*GIGGSIPHP 
PLFFFFFFL*DGVLLCHLGWSAVA 


896 


14797 


A 


903 


1 


352 


KGIFPVWSCG* KGTQKKGGLSQGGQGNV 
P PCGIHFVKE FLGFPGAQLL IS* GGRL I 
F*KGPKKGFLP*TVFG/RRAQFEAPR 


897 


14798 


A 


904 


325 


2 


RliEEKLWDYWWEMKVKKNTEYPLNLPVE 
DIQKRPDQTWVQCDACLKCRK* PYGMDQ 
HLEK\WYWCNNHDSQFRYCMVPKDPEDM 
DLVHPNYGKPYKKPSKETYWFSQMP 


898 


14799 


A 


905 


1 


329 


I GLAI HGAER I VRGQTAKS LAVHN\ VCE 
QKKI KKKKGGPGRE FQNLKWENPKNPGG 
SF*RGLGPQSNFFYLKQ/RLGFFF*KKP 
P KI * IIiAMEP PNKKI F * KNKNKAHFFF 


899 


14800 


A 


906 


148 


2 


DVDKLFLLRSLPTL * RPE YGS YMI SGT \ 
TGQPYGGTMSEFNTLEANMRIR 


900 


14801 


A 


907 


384 


1 


ESKRS I FGPPNP PP PGFKGFSRHAPLWIi 
RSREPPHKAGVKKGFPRFRPGS*MPIP* 
SFLQNCLPQRVGFPSLPPPGAPFLiKISL 
FliRQESRSVTQAGV* \YGLLQCRPPRLK 
SFSCLSLPNSWDYNR 


901 


14802 


A 


908 


3 


297 


TKIKSL* INHLIRAKTVKLLEENMGINIi 
HDLSSGRQQPFRYDTFQ/SMINTAIKEK 
IMKLS PVKMGS FI SQRTPLRKERQLTK* 
EKIFADVYLIIGRRG 


902 


14803 


A 


909 


169 


373 


ASNILSATDISNTF6PPGSQGFS6REAY 
VEAGTYYTNFSCLGQVKVF*YWMQLIVS 
I VLL * LL* VTQKL * L/GPRQEKFV* YVP 
AST*ASLPLKPCDPGGPECVFEMSVADS/ 
DV*SLWLAL1GESKFRCLVF*SKFLPSS 
AYTYSPLEKKLFILLGCFVMICFCFCFL 
RRVL 


903 


14804 


A 


910 


132 


368 


GRIFLFVGQEKGARVSPLFLF*DRVSLC 
HPGWSAVAQSQLTTS /TWTQNNS SHLTP 
P * VAENHRCA/PHTPNVLFLC 


904 


14805 


A 


911 


2 


339 


]SrSWAQE*AGlTGSCHHTQLTLEF*AHDR 
QMMFFLFFIiKTNFTFCPPA*RPWPGLGL 
TE\PLLPGLKHFF/ CLTLPSNWDKGHLP 
P/HPYQI/CGFLRKNGASIilSG 


905 


14806 


A 


912 


3 


381 


LHFCDTHPLiTPRPVS IQRQCLPLVEAGI 
RWRAL I S PQLHP PT / YSLLS S WD YKHAA 
PHPANFFFFFFKKGSIiTMFPRPGGS* IF 
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SEQ ID 
NO: of 
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nucleotide 

location 

correspond 
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location 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIa!anine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L~Leucine, M-Methionine, 
N=Asparagine, P=ProJine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaUne, 
W=Tryptophan5 Y=Tyrosine, 
X=Unknown, *=Stop codon,/===possible 
nucleotide deletion, \==possibie 
nucleotide insertion 














lYLL I rLJERES Y/ AVAQAGGQGPNFNS6 
QPLSPRFKQFSYLSLLR 


906 


14807 


A 


913 


394 


1 


MSKGKIG*KLG*KFAQKVSQPVNAKEKL 
LKE I KHFTLMNT * M I * KRNS / L I ADL E K 
ILWWIENKTGYS I P*TQNRIQSKALIL 
FYSMMTERGEE A\ KKFEARRDYFMRFKE 
RSRLQKTMKV* DE AANADGEAA 


907 


14808 


A 


914 


326 


200 


HNWLS I WKKVKSKTCFtiMPHTKINSRWI K 
DIK1/R/LHRIKILQENVGNHIYKI*RQ 
KRS * PRHNFLHKNCKCLVWHKTNK^NH* 
KI YL * CX * KTFAVFM 


908 


14809 


A 


915 


3 


334 


L WRVKGP KFKFWLHHQLGL YC * VI S FF 
INKMKE YVL I HRI K* DHKN IF* MGYKKL 
*lSfW I W * LVLFGFS S KNHL / WSGAVAHT 
FNP S TLGDRGRR I TQGQE FQ I S LTNMVK 


909 


14810 


A 


916 


14 


299 


YQKLPQKKSPEPNAFTDKFYKIFK*LTA 
ILQ* LFQKI /E/KETPLPK* SNESSNAL 
I*KPDKDIT/SKNYKPISVSNIHAKIIiN 
KI *GYQVQQKQLYSK 


930 


34831 


A 


917 


3 


339 


SLQPQSCRLKCSSCLiSLSSCWDYKHKPP 
CLA/NFF/VFF*EKGICTVKN*MGDYPL 
GRRVTKRDHERGF*NAGKVLLVGW/CGF 
ETQSRFVPQAGGQGRNLGSLQALFPGLM 
PFSC 


911 


14812 


A 


918 


2 


321 


GIISGIYKEBSQPKMTDSSIKK\KDLNR 
NFTKEYVQMACKHMKRCSASLVXREIK\ 
IKST{yiR*H* \TPTRMTKIKD\DKNTKCW 
* GYGAVGMLIHCQ W XNTKMVQPLWKN 


912 


14813 


A 


919 


51 


326 


FFVFCLFSLVFISLNCGLF/PTFFRIPS 
L/PYLQLI * IYIiFFY*LKPI FLKFFRDR 
VSLCHPGWSTVA* S *IjIEASKYW\VQ * S 
SHLSLLSS*DY 


913 


14814 


A 


920 


239 


2 


DSLIIiSAVQ*HDLGSLQPPHPGIjKQ/FIj 
PSNWHYRCMSPCLAYFLLFLVETPFCHV 
TQASLELLGLSNIiPASASQSAGI 


914 


14815 


A 


921 


1 


242 


PRPRRRLIiKYFILFNFLRQ/RSRSATQA 
EVQ*HDHSSL*PQTPRIiK\HPPASET 


915 


14816 


A 


922 


249 


1 


ALFCALKKTLCGTLMF lAALF II AE \SR 
HNQTQPTCIaSTDEE INKMWHRHAMGNYS 
AIKRNEVLIYAKI*MYr.ENIMIiSEII 


916 


14817 


A 


923 


404 


1 


SRPQKKLGELKCS*RPKGLNKKNQRKPG 
TFPFPETKFHSFCQAQVQWVYGC*SHPY 
LtiHIIFP* *VP*MFI»XriITV/PSRNKDL 
IGNSBKQIHN/WSSLYFYF*LFFLRWSL 
TliVPQSGVRWRJSUiGSLQPPIiPRFK 


917 


14818 


A 


924 


6 


358 


FFCSLLGHGGETPNWS PLPHPWL * ETKS 
PPKPPHPIiKKKKHSLFFKIFILKKKKNG 
PPPFPPAGPNPRG*NPPPPQ/RPPKGPP 
PPKEKINPPPPPQGKNFPKGKRPPLGPF 

PKKGGGY 


918 


14819 


A 


925 


268 


2 


HMYAFVKIDLTAHLRFSAGKKE I PAGKL 
YF/L * RKNPKNQ I GNQKKKKCLPRAGRM 
VAHAWNPSNLGGQGAGTT*SQALETSMG 
NMGKP 


919 


14820 


A 


926 


74 


338 


IAGITGVSH*ESKNSY*KQFL/WPGTVA 
HSCNSSNLGGQGEWIT*GQEFETSLANM 
VKP 


920 


14821 


A 


927 


339 


1 


LRIiFACPPLPKWWDYRGEPPRQIFFSWH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G1utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lf=Leucine, M=IVIethionine, 
N— Asparagine, P=Proline, 
Q==Glutamine, R=Arginine, S=Serine, 
T=Threoninej V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














FFKKQGFICPIEKGRELSNCCFQKEVLI 
*KKVILKI*PFFFFFUy[ESCSVA*AGVQ 
WHDLQPLPP\NSWDYRCPPSRPAKFCIF 


921 


14822 


A 


928 


3 


416 


RCWWENKIN/ PFWK/S/VWPCVTKVNIH 
SPYDLTISLQYLPYRDEYLCSYKNLMFL 
XVLFI XTKNYKQPKCAS TO* WLRILWYL 
/YMENYSAIKMNK 


922 


14823 


A 


929 


344 


69 


YKRS PLL \KSA*DGPANWKSLPRS * YKR 
SPIiL IKAKPDQI I SGWEAS PL 1 1 \PRAL 
LQKLKPAASKF*W/]SrPKKKRALIS/KAI 
PSKKNKAEGIMFPKFKV*YRDTATKTA* 
WWYKNSSSL 


923 


14824 


A 


930 


378 


1 


hgkkidprfipytki/nlkwfidlnvkp 
ktikfle*nige/tifvidhkk*np*kk 
klvn/wdfiki ks fcfvh* kpavgk* kr 
/qp * trrnyvqthtsdrgl i srickelf 
* lns kvins i rkwaqsln 


924 


14825 


A 


931 


479 


83 


SRKGIiGRGLKRGGFFPP PPPKGLPGGVA 
PPGFEGFFGSL*KG/DRG*PFLKGVKGE 
KGKRI*EGG/EKGGGFLPPPPPQKGFSK 
NFLGFYAGKKENP*GGSPPPPLFGPAQK 
I S LGKGENKVP P P PGNL FY L F 1 F 


925 


14826 


A 


932 


36 


440 


TTYASLDEAQLPRAKFNAFL,TTF*HI IK 
NGPILGKIGRKYMIADQ\GHRMKNHHCK 
LTQVLNTHYVGPKRILLTGTPLQNKLPE 
LWALLNFLLPTIFKSCSTFEQWFNAPFA 
MTGERVHLYEEET I LI IRRVHKVL 


926 


14827 


A 


933 


8 


398 


CLQGKEETLTTNWHLRYFLPFFFFFWPK 
KGARICNGGRP PGGS PKNQRAPCPKP PEN 
PGEKTPQKRGFFPPKPCPGPGPGPTGGQ 
GKPP/HPQRPQILPFVPQAGPQGRKGG* 
WNPPPPG*KGFPAPTPPRM 


927 


14828 


A 


934 


1 


441 


TRSHPRALKEVYTE INIVFMPANATS IL 
QPMD * G Vr LTLKAY YLRS TFCKAI AAVP 
SDS/SSDGSGQSQLKTFLKGFLILDAI* 
NIGDSWEEVKISTLTEMWKKLNPIFNND 
FEGFKTSVEEVTADWKITEEVQVESED 
GTEFLQSH 


928 


14829 


A 


935 


3 


383 


PGFKASKDRLTFSSGNNAAGNFKLKPVL 
lYHSENPRVLKNYAKSILSVLYKWNNKA 
WMTAHLFTAWFTE \ P YVETYCS \ EKNI F 
FKI LLL I DS VPSHPRALRE I * KQMT IVY 
MP/STTTS ILQPMNQG 


929 


14830 


A 


936 


417 


1 


CFFSRDEVSPCGPGFSPSPD/HHDPPPP 
PSQCWDYRRDPLWVPPHICFLIHKKKRS 
SHMGSSMYDP*KPPHKWMKSPPPVSVLY 
6SIPVQVQIAPPPETNPVYFFSPPPPFW 
GGGRVCVCVCVSSEVCDFFGGDENVIV 


930 


14831 


A 


937 


412 


3 


KPTRVKKINPPFPKNTKNKGGRVGGSLY 
SPFFGGLGQKNGFTPEAKGS INLKFPTA 
PKSWGTQQKSLFKKKKKKKSILKFIWNQ 
KINPKQIKQS*GITLPE/FKLYYKEATV 
TKTAWYRYKNRP IDQ WNGTDRNKATH 


931 


14832 


A 


938 


1 


416 


KNRHIDQCSGIERPEIDSHKYSQLIFDK 
GAKVI**RNDNLFNKWYFNNWiyiSTC\RK 
INLDTDLT YLFQNGS QT* P *NI KLLENS 
MGGNLGDL/GMSQ*VSSSSSSRRIHDKK 
LVS/WDFIKIENICSEKGIIKRMKSQTT 
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G=Glyci n e,-H=Histid in e, I==IsoIeucine, 
K— Lysine, Lr^Leucine, M=Methionine5 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














D 


932 


14833 


A 


939 


4 


443 


DFDYNHDHDYAELGTRPGSVGQGSPDPQ 
FTP SRMGREGEGTHS IL * CS SLGMG VIA 
DLSTDPTELEKRALEVAGPDGQASAI S P 
ASPRRKAGDGGHRRALPGCTSLTGETTG 
KSGEAGQDGKPPGD/GPIGPYSLPGSGP 
GSGESMMG 


933 


U834 


A 


940 


3 


404 


LFMPIiEFFF*KPRSC*VSQA6MQGCYLK 
ALQAPPPRFTP/SLIjSS*DYWSLPPPLA 
NFLYF * ETRGFTGLTRMIS 1 PQPTEMPG 
IiASQTAKLIFF*K/HRVLVES /HG*SAR 
AVHRGDLiHILEP*TPGLK*YS\CISL 


934 


14835 


A 


941 


397 


1 


fvpnsqvanakkkflkeaesttpvlsg i 
rkontfaadme * v* mvwikdqps ys i pl 
sqs viq / sraltlfdctkaernrrgklq 
wepavXegskgwlmrfkershlhnikvq 
deavs ypedldkmdavntkhq i 


935 


14836 


A 


942 


441 


29 


svtlckhtvhvptflrrgkrcplfgqcq 

PWPSPG/PHPLSiyiSPV*PRPQ\PrPLHS 

applalgpfipmlsmaglpfpfqgsglr 
nwkppfpqpsllq* *hfpkplkpirglr 
pfppstlsiplpgtrpraklgvmrsrl 


936 


14837 


A 


943 


2 


397 


ardaapepgehllqglrsarhglrpprds 
rpgpd\pp\spphplplpavptfvfsfg 

DS WLL * S P FF PAP WAS EGGACAPNPHAL 
VPS P S GQASGLRGGAPS WKAGLDTDGQQ 
AGRQSPGPAPPSPPIiPPSQDC 


937 


14838 


A 


944 


3 


449 


liRQVWHEGEMPNKTTLIYHYTPI * ITNI 
KNTDNTKS LWGCAE ARS FTHC / WMR I KM 
VKPLWETVQ*YLI* *KLQLPFNPAVALW 
\ S I C PRKVKT Y / CHRKNKTRS * MF I V/ A 
LFVTAQTKN 


938 


14839 


A 


945 


380 


175 


RAPAVPATQEAEVGGI»/H*TWEDRLNPG 
GEV/CSEPKWCHCPPAWATKPNCVSKKK 

KKKEKRINHQHDEK 


939 


14840 


A 


946 


21 


331 


VAPLKYGPSKETI / IQSVRQSTECKKIF 
AYYPSDTGLITR1YEEL*RLNRKEKLNS 
S VYKYAKDIiI *PPSKEDTQMANRR/H*K 
KCSTSLl IREMRFKTTMRVDAV 


940 


14841 


A 


947 


365 


1 


GGGPFWFFFFFFLGGFLKIGKIWAKKKT 
GPSFPQKKKRGPKKKKPPGFFFFFPF*K 
GFVAPGGGQWEGFGSLQPLPPGVKQFFC 
PRFLRKW/DSRFFPFPLIKSNAEQPPTL 
YAGVRMRYIK 


941 


14842 


A 


948 


357 


137 


I S AHCELRLPGSHHS PALLCS PGLSRTP 
NriKQSSLFSLPKCWDYR\LATVPGQLLIi 
F*ARHCIS IDPSLHIiNJIM 


942 


14843 


A 


949 


53 


352 


REDNHHKCIRNKMINRKWANEVNRCFSE 
EIQMVKKHEKLIASPPIRSMQIKRYY\L 
NSIA*KKEKKSD\NTKLWQGFGETKSV* 
RYI IN/ PYDPTI PLLGI Y 


943 


14844 


A 


950 


937 


3 


KVSPYKINIHK*VAFLCTNNIQAEKK/M 
NMVLFTIATNKKI/SYLVXH* IKEVKYL 
YNPCN YK\ LLKKNRDNANK* KGTPCS * I * 
R INII KM S VL PKAT YRLS AI S LKL * MQL 
FMELEKE\TILKVIWKNKTAHLAKATLT 
\QSNPGGLTLPGLYYKATVIHITW/HCY 
KNTHVDQ* /NRRESPEISLHFYTQLFSD 



581 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Va!ine, 
W==Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /=possibIe 
nucleotide deletion, \='possible 
nuc]eotlde insertion 














KVNRSKQ/WKNCLSNKWYWESt\rLlICRR 
RKLKLYLSPYKKCNSK*IKNLTVKLKAI 
KIVEHQKSTVLVTGLWKGFMTNTIKVNA 
TKXKViCiSr*HLrKLKNFCTPKET/T*QIC 
RQQRVWNKLFSNF 


944 


14845 


A 


951 


246 


1 


AAS TKTGVQTKTCT * MLI AALVI TVKRE 
KQSKCSSVE\E*TMWYIHTMEYYSVTKR 
NDVLIRTTWMNLENSMLSKKDSHKG 


945 


14846 


A 


952 


2 


255 


QGCGEKGTFLHCWWEyQPLWKTVW*FLK 
DPETDIPFDQAIPLLGIYPKEYQSLYYK 
DTXAKTWNQPKCQLMVDGI KKMWYIVDA 
A 


946 


14847 


A 


953 


343 


3 


GSIYTKEMEPKVNNLQK*KASCTDGL*G 
E FY* TFENH I P I LYNLFQKTE AKGTLTN 
SRDEASIIIilPKPEKCTRRKENDK/P/ 1 
SLMS IDAKLLNKILAN* IQQSRI YSRHT 
RLIQ 


947 


14848 


A 


954 


1 


349 


AQPPFIIiFILHP \ I YCIFS S * S \R*QKT 
LKTT* FQGFIFCFFFPFK/QAKSCSRIK 
GRGPIlAKGTPBLPGIiRDPPTSPT*VAG 
TKGGGPHTQLKFIFIAK*FYISF*HNGK 
FCSRRGR 


948 


14849 


A 


955 


2 


351 


GLK3SrYAKSTL.PVLYKWNN\KA*MTTHIiF 
TAMSTE*FKPTVETYCS/EKKIPFKILQ 
LMDNAPSHPRALMEMYBCEINAVFMPVNT 
ISILQPMDQGVISTFKSYYL/RNTFCKA 
IGAIDSDS 


949 


14850 


A 


956 


336 


3 


PTKENFEPDGF\TG*PyQTFKELIPPQT 
FLKTSREYPSSSSSSSSSSSSSSSSSSS 
SSPYRPl/SIiMNTD/AKILNKILTNQVR 
LHI KRNTHHDL VGF I PK\ WFJSTI RKTDQS 
NM 


950 


14851 


A 


957 


181 


1 


RWDLTMLPWLVSNS XGIiRRSSH/ liPKCW 
DYRFEPPCTA*GWFLrGPHWNVSTWGS 

PRISRQ 


931 


14852 


A 


958 


3 


293 


GGLTS PHVKT Y YQATVI KTAWNG * RGVC 
MDQ/YNKTENPETDPCKYSQIiMFSEVTK 
AIQ*RKDSLVNIENWNN*iyiSIHKKSSSR 
KHLNQYLTPYTKL 


952 


14853 


A 


959 


350 


3 


NKKKGQSLVFRSPTIiFPFFFL* IEMG\F 
IMLARLV*NS*RRNMTTSGSQSVGrTGV 
SNHARPKRKLFLFS I ITTIGWAGV\WWL 
TLVISALWDYRREPPRPAVLLPFHIQRS 
RMPLSN 


953 


14854 


A 


960 


349 


1 


GGFPPPLFFFPP/ PPPWGKPPPPFF YQ \ 
SFPPKKKGGPKPPFFFGSPWGVHKKQFIi 
VKSPLPFFWKKKKPPPPF*KKFFPP\VF 
QKQPGRVKKPPYERNSWLLGPPTKKSPK 
KSTGS 


954 


14855 


A 


961 


379 


160 


PGPQGEPPFFLKNPQKLVGQGGRLF*AP 
LLRGVRQKNSLNPGGGGALKPGS /HLWP 
PSWGKKGDFLP*KKKKKKK 


955 


14856 


A 


962 


184 


2 


TFFSPKKIFPFFFFSKKFFPKGTFFSPR 
ENFFFFFFFF * DGV* L\ CHPGWS AWPS 
RLTATS 


956 


14857 


A 


963 


1 


328 


ARIiVLKS * PQ/ CDLPASASESSGI TGVS 
HCASASQSAGITGMSHHIRPKWISLYIiG 
FWSFNKNVLHFFCASSLEGESMNWELLS 
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C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G^Glycine, H=Histidine, I=lsaleucine, 
K=LyS!ne, L^Leucine, M=IVIethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=trn known, '^=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KKNWE PLDKKLHGS TKLGGNHVCVNF 


957 


14858 


A 


964 


1 


334 


HPLGGRVGGVPLGP/WLNE>PPPQ*GTP 
FFLKKPIJNPGRGAiCPVlPAPRGG*GGKF 
LLPPRGG/PSMNPNCPPAPPP 


958 


14859 


A 


965 


206 


360 


QKSMSLHQQ*QDIHSSQAHKNSSlAr\PGA 
VAHTCNPSTLGGQGMQITRSGDQD 


959 


14860 


A 


966 


10 


332 


NFFSPGAPPPPPPFGGGFFLKKGPPFP* 
KKKILGGPPPPKKNPPGVFPFSP/AKNL 
GFPPPPPPFFFF 


960 


14861 


A 


967 


1 


282 


rkc:akdlnrhstkediwisteci*hc*s 
lrkstlkprdmphiy*nd*lykknddik 
cw*gygvtgavfhfhwsrmvqpllnnw 
tvprnvkftl 


961 


14862 


A 


968 


339 


195 


FPFCRDGVSPCCPGWSRTPRPKQSTCLS 
LPKCWDyR/RL*ATVPSLDYLF 


962 


14863 


A 


969 


3 


342 


IKKGPHPQLKKPPGVPPVFQFKKSYFPP 
PPPLFNNPFFFR\DKASLCHPGWSAVML 
S YFTAAYTSWWRSSSHLSLLSS *EWRH 
TQPCLFF*P*FL/CRSK 


963 


14864 


A 


970 


384 


6 


GRKJSrVAGF PLS PLYT P PHGHGLG P PQT F 
GAGPPAHKSHQKVGRQKRGPGFVPPRPP 
AFLFFFFFKE^SVLEKKENNLYNSLFAY 
KIIiNKVQGETQCEGRAHIC\VCVCVCVC 
VS VCTCVHVCALAI C 


964 


14865 


A 


971 


427 


109 


LNSQFELQEKRISNLEDNEKII*SEKES 
KNRMARN/E/QSLRETWDTFGYTNIGIM 
ECPEGEEK/GKRKRKTIKKKMAANFPHL 
MKNINLHTOKAL*TPTRlTCTGPYGG 


965 


14866 


A 


972 


389 


145 


SLiQP*TPRLMIiSSHLSLLSSWDYRCVPP 
HIiANFLYF* *R*GFAiyiLGSSNSSA/SAS 
QSGGITGVSHHAPWFTFKKKLVADT 


966 


14867 


A 


973 


170 


419 


VS FLFFFFFLERQFYFFS PGGGEGGEFG 
L r EPFP \ PGFGNFS CLTLW/ RRWD * GAP 
PPLPAYFGFLIK\TGFPLVGRGGLDLRP 
SR 


967 


14868 


A 


974 


212 


3 


QNSMIRYSRSMSRSCLCTTILILSLNRS 
LKSPR* /WPGWAHACNPSTLGGRGGRI 
TR S ED * DHPGQHGE TP 


968 


14869 


A 


975 


28 


356 


VFETSLFRNKKEIVKGTSLiPDFRTYFKP 
L 1 1 ES I * CLHGIGQNNAWNRVGNLQ I DA 
/ECAENLIYEKGGISIQW/SLFNK*CWS 
ITIHKEEKKIRPBHCIKIKGTKDFKKLS 


969 


14870 


A 


976 


350 


1 


LPCKEKVSLLCPGRFEPPGMKKSTLLSL . 
PKMRGYR/RIiAPPPQLEIFFHFPLTSKA ' 
TPIi/CLGRSLKRLNSQiyiFSPFFSBGVSL 
CSQAGIQWRDPGSLQPAIL,WFK*FSCPS 
ILSSWN 


970 


14871 


A 


977 


4 


343 


PIiHSSLGNIVRPHLLNNSNKNDTTKCW* 
GCEKPGFLIHCIWEYMMV*LLWKTDWQF 
I.IKPNIHLPyDSAVAHL6IYPR*MKTFL 
QKIi /CP * MPIAAVFIi 1 AKfTGRQIiRCPS V 
SK 


971 


14872 


A 


978 


3 


424 


KRMRRQATDWEKIFAKGISDKELIiSKIH 
KELLKLNKKKS PKQVACFKMGKQT * TAH 
LH*TIYR/CRRVTHRKDHSTRSWLLREL 

QILNT 


972 


14873 


A 


979 


114 


325 


QQQQNN* IKiCLAEDSNRDPSN/EDIQMA 
NK*RCSTSLVIMEMQVKVTVRYyYLTIiI 
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S I AI I KQKKKQNPKK 


973 


14874 


A 


980 


339 


2 


GPTGFKFNQGGPPPKGSFFL*KISPPS* 
NPPFFFGPQKGFKPFFPPPYLNPGKNFF 
SKKPLFFFF*KKGFFKPPPFFFFFFFF* 
GRVSLCCPGWSAV\A*SRLTVTSTSHVQ 
T 


974 


14875 


A 


981 


2 


377 


QMANKHMKSFSTSSAFMKM* IKNKMRSH 
DTC IRMANINTNDT\CW* GCRSNKI LMH 
SW/ WECKMVQLL* TMV* KFLI KY /DTYH 
DPAISIiliGS IKKKSKFCT* /T/CKILYIi 
ETVLFIMNPNWKQSNWYSMG 


975 


14876 


A 


982 


1 


457 


INEIEARNTTEKFNKTELTSSSSSSSSS 
PLVGLTKRKKSQN\NEGDITTDTIE 11/ 
TIIKDYSE*IYA/NKIEQASSSSSSSKT 
YNLSGPHHBGTESIiNRPIMSEEIESLSQ 
NLPTNRSPRADSFTGKFL*TSKDELPPT 
LiLKXFQKTEQERTIiPR 


976 


14877 


A 


983 


332 


2 


TPKAGQMQVI.FPQYGSKVARAPFSWAL 
PPPPPFFHFFCFPQPHCSLPPQIPPPFS 
RFFLQVKSSQQGPPPSLLGWGEQ/NTLP 
FFFFF*SHSIiALLPRLBCSGAISTAA 


977 


14878 


A 


984 


365 


2 


KSSKLFNYPPLFFFDKNFGPKKK/SGPF 
FCPFKNFFNPRVPGPNFNFKGFFFLKKG 
PTVF* IKKFWWGPIiPPLK* PSGVFRFFH 
FLIWDFRPRPPFFFFFFFIiVERGFHHVG 
QASLELLTS 


978 


14879 


A 


985 


1 


153 


GVQWCNYNSL*PRFPG\SGDFPTSAFQV 
AGITGVHHHTRLFFFFVGFFFNF 


979 


14880 


A 


986 


225 


1 


LYIYRERSRERDRDRDRDVSHTHLYIHT 
RTHLDNILFCHPGWNAVA* SWFTVALNS 
WV\KSSSRPSLPSSSDSS 


980 


14881 


A 


987 


2 


375 


GTDEYILIALIWMVSWVCLFFFFEKKA 
P FC P P AGKTGANFGLRAP PPPGIKEIFW 
PP*PSEEG\EPRGPNPSRGNFWFFKKRG 
GSPL*PRLFG/HPRP*GNRPP*/PPQRG 
GNNKGDPPPPPGGIFF 


981 


14882 


A 


988 


55 


335 


HIYIDVFVSGSWLLTV/ISLLELTVFCY 
NVGAL*ASG\QASElSrRISVSDFLLPSFY 
LPKAGL * S FPAFMMMGHKI L 1 P EKVI PH 
MLEEE\TCTERP 


982 


14883 


A 


989 


295 


369 


IliVTRNSEL^VLPEQRTRFLEKAMALRS 
IPLLKR/NGRPGTVAHACNPNTLGGQGG 
RrKRSGV*DQP 


983 


14884 


A 


990 


2 


240 


KKISMNLEHFMLSERSQS *KTTYYMI I F 

T*KLQWRQIYK\TERIYIYIYIYIYIYL 
FFFFFLERKFLFLFPRGEGGGPF 


984 


14885 


A 


991 


463 


0 


GFRQLSCLS\LPSSWYYRHMSPHliANPF 
\ TF* VERGFRHVGQAGIjELriTPWDSPS S 
ASQSAAIH/GMSHCTQP 


985 


14886 


A 


992 


82 


365 


E I CHQLYRS FLCSLFC/DHPSWKSH*DT 
MNCVHFI IRIiLNFS FFPPEKKFRPVPQV 
EGQGSNLG*LKVLPPRLKPFSCIiTLRRS 
WGYRGPPPPPVN 


986 


14887 


A 


993 


3 


317 


GliKQSS CLGLPKCWDYRHKPPCPTHI IF 
NTH* 1 1 KYLNVS FS FCI/ PLOWS 1 ALSD 
HVQ / PV* LYNMLVS S FLLLLLFFE I E W 

ILACLFGPVGSPILFFF*YILTF 


987 


14888 


A 


994 


350 


139 


LFRRLRWEQHLS \QGGQCCSEP*SHHCT 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leuclne, M=Methionine, 
N=Asparagine, P=ProIfne, 
Q==Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophanj Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PAWATE *DPVSKIQQNKNQDTFSHTFIL 

ECRTSKGVSAFIAL. 


988 


14889 


A 


995 


219 


1 


RPRRPLIQKTKAKNHHFTPTRMPVIKDN 
NKC*QGCGEV/WIT/CRMVQPLWKT/IQF 
LKMLNTELP*DRAIPLTGGKE 


989 


14890 


A 


996 


3 


498 


CLPWAAVARQPPSAHPFPQPGP*LRTLA 
PDTATAVNQALQRQES *TGC* S PSGWPA 
VPTPVAPG/PPSAG*GWDPRAWPRHNG 
PPGQHPKEKTYQPVPAP*VQLSPTRQTP 

APMACSSLLLPALPPPAPAARPPCPAPP 
FliFSRTSTLPPRRliSSTIKSKTPPGP 


990 


14891 


A 


997 


2 


419 


ADTEAAASYPDNIj \ANDES S YTKQQI FN 
VDKTALYWKKTPSRTS lAREGKS KPGFK 
TSKDKLTLLLGANAADDFKLKPMLTYHS 
KNSRAHK3SrYAKSTLCFRN*NKAWTEHLF 
lAWP/EYPKVTWTNCPEKK/VLIiLTHN 
APG 


991 


14892 


A 


998 


388 


405 


G*FFPLSASKFFFCP*ALKIiWMEGCHIR 
PPPQVRFPLQSSQA/GFISASLKGKGFP 
ASMPANIGPPKGYL*PAPPFFFFSFLRQ 
SHSVTQAGVQWRNSSLQPQPPG/LRRSS 
HLSLPGSWDH/MPPCPANFCIFCTGG 


992 


14893 


A 


999 


1 


423 


WILHD/NLRQPAQWDDQEAAPKHFPKPN 
SH/QKKVMVTVCWSAAGLIHYS FLWPGE 
TSTLEKYVQQIDEMH*KLQCLQPALINR 
RGPILPQDNARPRVTQPMLQRLNKLGCD 
VLPHPPYSPDFLPTDYHFFKHFDNFLQG 
KHF 


993 


14894 


A 


1000 


2 


406 


SDLFRAI^ETTSVIVYDVSMDYLEIDSE 
MIRDLLNTFPV\HLELQQDSSGVIQGAG 
I TEAS T INAKE IMHLLMKGNRQRTQEP T 
AANQTSSRSHAVLQVTVRQRSRVKNILQ 
EVRQGRLFMIDLAGSERASQTQN" 


994 


14895 


A 


1001 


290 


3 


VTQLPETVPHFLRDRVSFCCSGS/DHSS 
LQPQTPGLK\YPPTSAT*VGGTCHHAQIi 
I FIFLRDGVS \SVAQAGVQWEDLSLti* P 
PPPGFKRLSCLRNR 


995 


14896 


A 


1002 


143 


408 


I KKK I KRE I * KYIiD I S KNK/DTYQNIMN 
ATKAVL*GKFIGINVYIFKNRKN*NK*F 
NSTSGKLKKLEKKEQTKLKISRKETIKI 
RAKINE 


996 


14897 


A 


1003 


84 


409 


DKLiAALPSSWTLHPGSPLGVTRAPPPPP 
PPPPPPPPPRVLEPVPRSLiYPGLAVPW 
PRALH* PPHPGSLPACPAPGLLGGTRQC 
S\QTILPKKKPPPLDADPANEPPPP 


997 


14898 


A 


1004 


392 


2 


EKNAVHFDQKNLVSIWEPLQVPTQVCIQ 
NKTNSWGIAGYPFHFLKRNQRFFNAtiKV 
PGPFSK*TIHLVNIiKWLFFP*DRVSPCC 

PSWSAVVKS*FTAASTSL\IKQSSCLNH 

PSS*DCRHTPLLFFVRMRS 


998 


14899 


A 


1005 


2 


421 


F VSLLLL I TSAI VAAGAPTGRT VYP PI.S 
GNYEHPGAFVHLTIFCLHLEGVSSIIiRA 
INFI AS 1 1 S 1 1 PPAITQYQTPLFG* SGL 
ITAGLLLLSVPVIAAGITIXLIjTEPSLF 
TTFFDPGGGGDPIL YQHLF* FFGHPE VY 


999 


14900 


A 


1006 


1 


428 


DIHYG* I IRYLHADGAS I FCI CLLLHIG 
RGIiYYGSFLYSET* SMGI ILLIiATIATA 
IIGYVIjP*GQISP*GATEITYLLSAIPy 
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SEQ Il> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C~Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K—luysine, L— Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IGTDLVQ* I*GGYSVDSPTLTRCFTCYF 
ILPVI IARLPALHL\LFLLETGSNKPIG 
LAS 


1000 


14901 


A 


1007 


1 


419 


S PIVP I VMAMACLMIiTERI ILGYIQLRP 
GPNVGGPYRLLQPFADAIKFLiSKEP\LK 
PATSTITLYISGPTIiTL.TIGLVL*TPLP 
IPNPLVULNLGIiliFIIiATSSLAGYSIL* 
SG*ASNSNYTLIGAIiRAVAQTISYEVTL 


1001 


14902 


A 


1008 


411 


27 


GQPGQHRKTPSPLS IFYF/ELAGGGDAS 
T*FQLLRRIiR*EARr,SLGI*GCSEL*LP 
PCPPAWVTK*DPVFGGRKKKERLSQLRK 
LRPREVKSIiVQHYVAHKGGNWDPNPGPM 
ATQPGRSCTTLCVPLCHR 


1002 


14903 


A 


1009 


21 


377 


MSLGGQQDS AQTTRS P QFAGF FGQRS I P 
DRTGLGFHLCVCKCVCACVQVSVCKCVH 
MCACACMCTRTPMCE CVQVCE CVQVCVR 
VSMCVQVCACVRVCACVCSGACG/ CTCV 
T*MSIiACG 


1003 


14904 


A 


1010 


1 


412 


HKANTSTYLIiTLVNTFSG^VKACPTTHK 
TAEWASTIiIEQIIPRF/GPAFISKIVK 
QVTTTLDVNWKLHTPYHPQSSGKVERAW 
S LVKQHIi I KIiALKTRQS WVTLLP FALiAW 
LWAAPQSPTGINPFELLY/SAPLPLSN 


1004 


14905 


A 


1011 


1 


410 


FRAVAGASRQENGA\TVILRDIARAREN 
IHKSIiAGSSGPGASSGTSGDHGELWRI 
AS LE VENQS LRG WQELEQ AI S KLE ARL 
NELjENS S P VHRS TAPHTQQVS P I APVE P 
PAEKP AT P T * DD VDDD I DLFGRDND 


1005 


14906 


A 


1012 


269 


32 


WISRIYKELLQLNNKKTXSISKRAQYLN 
RNFTKDI*MAKKPiyiKRCPTLLVIREMQI 
KTTMRYYFTPARMATIKKTDNNEY 


1006 


14907 


A 


1013 


21 


443 


RIRKKNSYPHYVKS IVAYTFI IRLFPTT 
IFMCLDQEVIISN*H*ATTETTQLS]:iSF 
KLDYLS I Ilil PVALVATWAI IQFSL>* YM 
\NSDPNIKQCLKDLiLMFLiMTILVLiSTAN 
NLCQIiLIG*EGA*MISFLLIS**YARAD 
A 


1007 


14908 


A 


1014 


1 


309 


SSRAAAIHGGACLWLQILEIRLRWEG*LS 
VGGQGGSKP * LHH* TIAWATERDF VSQK 
KKKKPQKANP\HLETSCKKKKKKPMIiGP 
PKSQRKEGPKAPQGRTIjWP 


1008 


14909 


A 


1015 


3 


256 


EKNQ ILKWTMDLSRHFSKK \NMQVGNRH 
MKKC*TSLI IREIQIKTTMKYHFTPVKM 
AI rWPGAGITGMSHRTQPSPTIiLLTHW 
A 


1009 


14910 


A 


1016 


397 


3 


LFFSPPPKWGFFFFPPPRFFAPPPFFFF 
PPPPFFFFPCEKPFFFFFFPPGGGAPPL 
SPPFFPPFFFFPPPPPPFFFFK/SPPPP 
PKKK*KKK*KKKKKKTPNTPKKKPKKPK 
KKKKRAAARDLELVGRVGGRV 


1010 


14911 


A 


1017 


2 


164 


EKERRE*KREE*KERK\RKEGEREGGRE 
RERERKKKRKRERKKDRCMFIVHFRE 


1011 


14912 


A 


1018 


1 


406 
♦ 


ns s pp knkn * nk* kqht * * eknfanyi s 
d i el vfrt ynellqlnktgnp i kns f * k 
nqfknkgk*pnfyitqsf*rnkpi*kf/ 
akdlnkrfskedm* lankriktcltpla 
irkrqikttvkssfka/iymvwket 


1012 


14913 


A 


1019 


302 


2 


IVEPLFLGWLCLFLQNPRVKGPGPYFSK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, y=Ty rosin e, 
X=Unkno>vn, *==Stop codon, /^possible 
nucleotide deletion, \=possifoie 
nucleotide insertion 














GKANPVVFPTV\SKQPFKKPAFPVYQKQ 
LPLiF/PALVDKKKVL/RFFFFF*DRVLL 
\ CCPGWSAMVQSWLTTTSTS 


1013 


14914 


A 


1020 


47 


382 


AKIALLHSSLGDRVSHTHIHTHKRSNEI 
ELVIKSSPKKKSSGLNGFPIEFYCSFK* 
EL I PXFLKL,FQQ\VEAVGILPNIiF * EVS 
ITLIPK/PRQEQNSNKTIYRPISLMNIG 
GK 


1014 


14915 


A 


1021 


273 


1 


QS KERRQRTQEEMGKTFHNlSfRENGTrjFG 
FS QSCLKDE 1 1 1 1 1 FETGFRS I SQAGVQ 
W/P/NHGSIiQPRFPRLKQSSCFSLPSSW 
DYMC*LPH 


1015 


14916 


A 


1022 


327 


1 


KFLKGWP lYLKKGFPFVPQGGGQWANYR 
SLVPQIYGVK\YPSAFASKINSTMGMCP 
PAWVTFGMAQLINGD* IP/IFFLRQSFA 
LVAQAGVQWHDIiRS PQPPPPGFKRPSC 


1016 


14917 


A 


1023 


3 


407 


THLTQR* /HNIQTPP / WV*TGTLSrVLM 
NFYLWM* FHFQSITLFIIiRLLTTKLTMY 
QGWLDVTQQRSYQGHHTPPVQKGLRYRI 
ILFITINTFFLEGLF*ALYHSSLFPTLH 
*RGDWT/PTGIIPLKPLEDPLLNTFE 


1017 


14918 


A 


1024 


3 


425 


LPPQKKRGFPKIPPREF*KTPPKKKKII 
FPPPGKNWPPTK/RFLKGPPPSRFIPFP 
FFPLPSSPSPP 


1018 


14919 


A 


1025 


430 


1 


GGFPPFPPKNPFFPLSP*IFG/RGVCPK 
FPPPKKRPFSKNPQRGFMNPPQKKKKIS 
SPPPVNFAPPRDLLKRPPPFFFFFFFFF 
FPPFFLFMHPSFPTSVPQWQITSPLCLL 
LILQCCVCVUilGMWGDQSESSPSHHLQ 
PRWL 


1019 


14920 


A 


1026 


463 


494 


RKTGFPRLT * HLDLDFFPFFFFWKGI F V 
APRAGGRGRNLG*IiKPPPLGWAPFSGLT 
PPKSWISr*GAPPPPPFNF/CVF*RKTGFP 
RLTR 


1020 


14921 


A 


1027 


606 


1 


AGGPGPPKWPGAFGWG/ SLNG* S * LKG 
KGSKGAPIPQSGPTVFPGGIPPKESPLR 
PPGGiyiQGRRGPRWPGASGC/VG*DSVLR 
Q P /GVTGEAPCPGAS S PQASQAARRGSM 
GKGGAFHSVWGGCVLSAAWPGWQPPGSH 
G* /PGRRGGQSGCSCPKG*AWWRPCIiAG 
SHSSAKHKGMSSEGPAPRGRAISPVSAG 
SIGPIAARGF 


1021 


14922 


A 


1028 


436 


1 


GGAPQGVLKRAAPFFFFWKQNFNFVARI 
IKGKGGVLSPRQFPPLGFKKFWGPTPL/ 
SRWGFKEGLKPPGS FWFFKKRWGPSMWP 
RWV*\IPGPKSL\PPPFSQRGGIQGGT/ 
HRPPIIPPSLP/CFFEMETHSVAQAGVR 
WRDLGSLQPL 


1022 


14923 


A 


1029 


1 


436 


PPKKIIFSPKP*IFWGGGGPNFPPPKKS 
FFSQNPPGVFFSPP*KKKKIFPPPPENW 
GPPKIF/YKRPPPPFFP 


1023 


14924 


A 


1030 


28 


366 


EDHLSPGGWGCNEL*LCHCTPA\WRQNK 
TLSPKEKDHK 


1024 


14925 


A 


1031 


3 


142 


FDCSALQEFGTRLYCPSWSQIPELKRCT 
HLSLPKCWDYR/R*ATTPGLRIVLEIjQK 
KLRR*CRELLYTPRSVTPNINDI*HlAr6G 
TFVTINEI ISIH*YIAIjA6IjRFLSSSDV 
LTSASQNVGITGMSHHTWP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=po$sibIe 
nucleotide insertion 


1025 


14926 


A 


1032 


7 


361 


PALRPSARGQQYYDETKNKTIiCRNAQND 
SYLPDPNPFSRFSTL/DHSWHQLEAPDA 
* KAP FGLYWNCGARVCRXQG I S AK* TEA 
CGLGTI KPS FFL r PLKQGELLGYPAYNE 
NLKKKKKK 


1026 


14927 


A 


1033 


472 


3 


TKQQMUSrVDRTAFYKKMS SRTFIANAEK 
SMFGFRASKEG* LLGANTTGNF\LKAML 
I YHS KNPRTPKWEAKS TLLVP * K*NNKA 
LVAAHIjFTA*FTEyFKPPIEIQIITADN 
APGHPR\MYKEMNVFMSAN\TS IL* PMY 
QGIISCRIPAARIEVKGA 


1027 


14928 


A 


1034 


488 


0 


PLR* IIiAQRSGIHSMKT*SGEAAAE*KS 
EARS S WFLRFKEKSHLHNI KAQDKAAST 
DGKAAESYLED\TD*DGNTK* ICNGDKT 
AFY* KEMPSRMFPAREELMPGFKASKDM 
LTLLLRANAVGDVNTjKPMMIYYTBNPRA 
LKNYDKTQLC/PVLYKW 


1028 


14929 


A 


1035 


114 


491 


NYFKNFTFSNKIaTL/CIiL*LiFLL*KLFKr 
F*TF* LLC/ 1 *HLA*NTNTLYSCTKIFS 
FFIFLFyKHFLKFIYFFYFLDFII*GAY 
* SLCYY* WYPS CQFCQVLFHVFWGLCC 
HVLVRLLYFQTDEPFY 


1029 


14930 


A 


1036 


114 


496 


LLRAILTYLKYKI SAMNLVSA* FIYLHL 
TYHCVF/DHPVQGR*LLNK* INELFCYR 
SFGF*WVFSYSHLSED*ALEEKYLRERS 
RWVK/DLNVTSETVKILEEILGKTLLDI 
GLGKEFMMKTSQANITKP 


1030 


1 493 1 


A 


1037 


381 


45 


knrgrkkns*lggp\sdkfyktf*kliq 
iiyiclfqknekegtl\*nsrpislinie 
ani qn/ ravlam 1 1 qkh i kki irhnqw 
f/ipgr*g*sntck:si\nvihhikqrri 

KSIEF 


1 AO 1 

lOJl 


149J2 


A 


1038 


134 


394 


VIiFKI RSGS ITQPGVQWHDHS S PQP * TH 
SWAQTI FLLSSQWGTKGMCHHTWL/ IF 
LFLFVF 


1032 


14933 


A 


1039 


483 


30 


YDGGSEIINYVLESRLIGTEKFHKVTND 
NLLSRKYTVKGLKEGDTYEYRVSAVNIV 
GQGKPSFCTKPITCKDEL/AYVSTTIYT 
SETCTFVDL* *DINKNDLPL*LQtLAPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r=Phenylalanine, 
G=Glycine, H=Histidine, I~Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI * VKNLMEFAKKLLELINEYNKVER 
YKIWIKISriLLAKlSrTWTLKF 


1036 


14937 


A 


1043 


50 


457 


TKYQIiANKNIVIKICSTLPI IRE/MQIRPH 
EISLTPIMMALSEEQKNNKYWGGCREVE 
TLVHCWW*CKTVQSLRKTVHRFPK*LKT 
ELP*ELPPK/PGSQRDICTPMFIAALFT 
lAKRFWKHPKCPSTDE*IKEMWyiQS 


1037 


14938 


A 


1044 


2 


469 


LKQSSCLGLPKCRDYR/R*ATVPG* FIL 
*NTTEDPNLTKN* IIjNTNH/WCYSVQIiI 
LGESNSTIKTSIiSFSQASSHQNTTVQFL 
STSPSFKNCQWLAILPRK+GLTLSFCFA 
FERQGL/NSVAQAGAQWLHHGSLKAQPP 
RLKQ/FLPNCPTLMECYRLQP 


1038 


14939 


A 


1045 


42 


200 


LICT*Mi:iIVALLVIARNWrKQIRC/SST/ 
DBWLNKLW* I PME Yy*AIEKEPTIGTC 


1039 


14940 


A 


1046 


475 


36 


IJSnm* IJmP I KRQKIiAG / YVRKHDPT I C 
CLS \QQSRFIFKDTNRLNIKGWRK\TFH 
ANSNQKIA/GVAVYVSNKTM\ FTRNKE * 
NY I L I KGS IH**CIIIINI CATNS*NLK 
I *NKLTSMKEE IHCSKI TVED / YNTLF * 
TMDRQKEFHRMQ 


1040 


14941 


A 


1047 


3 


326 


FFFFI III IFSTERDS VS KI III IKNKK 
DLF * FLVTQVTPE I TNQ Y I S TLPLQS KT 
KWDRPGTVAHSCNPSTLGGRGRRI/TLR 

SGVRDQPDQHG 


1041 


14942 


A 


1048 


403 


15 


EVKNSSFSYRLMTEIIVIPFGYVSKLYNCD 
SGS*LS*SFNVCKMNRY*Y/LEERGNTS 
FKSLLLLMWKLLHSAIFTSA*TPRGL* I 
YFIYMYVLETWSHSVAQAGMQWHNQGSIi 
QPSPPGLKEFHRTDIVYAR 


1042 


14943 


A 


1049 


25 


457 


YMWD^rNSHHSCMFTLLPDS S FPPPSLCFI 
LRSLFLLYAELPLAFLLVKDSFILFV* I 
MSVFHLILKLQ*NIHNIYFTILTIPKCT 
VQWH*VYSHFSVAKTAT\ISRTCLIPSS 
WDYRHMPLCXANFFIi*RQGIALLPRLVL 
TSWLQ 


1043 


14944 


A 


1050 


14 


478 


FEPYCRH*TTRWN\CWWGYELVQSLWKA 
TEQ YVLKPD I CVF / LEPG VS LLGM YS KE 
LCTLC\YQKTRMFIATLFALVKS**LPK 
S S * MVE * I TKL * YFHTME Y YTAM/ KNE * 
ITI^TTIWMNI/TIILNERSQTQK 


1044 


14945 


A 


1051 


2 


465 


GIDQPSKPIPL/TQSFTQTKA/LTFFNS 
VKAERGEEAAE/ER/VEASSD*FMRFKQ 
RSHLHNIKVEGEAATADGETAGS \SYPE 
DPAI TDEGSHTKQQ I FS VDETACCWKI S 
SRT FAAKE KSMS VS KASKDRLTLLLRAS 
AAGDLKEFPRLV 


1045 


14946 


A 


1052 


392 


24 


DFAPRRKKRETRRS KINSLTS * * KELEN 
QVKTNQKGSRKQEITKIKVELREIETIiK 
TFPKKKVNKSRSWFYEK\ INKLDGTLAR 
QINKKRKENQ/DTIRNNKGDI ITDPME I 
*TTIRGIPPLWR 


1046 


14947 


A 


1053 


204 


488 


CAVCYTQISLTEWVDIj*SLDPGFETVSH 
LFDRVNQP*DCRRVTDMIjyiVF*NIiHFVI 
IiFFLFWF/DFVFVIiRQS FPWTQAGVQW 
RDLGSLQPIiPPG 


1047 


14948 


A 


1054 


498 


2 


SWVCLFIiSPGITSHTYVPMIFKIGAKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, "=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VHWWKSILFNKSCWRNLISTCIRMKVDP 
M/LTPGTKINTNWIKDLTPSAESnCLK 
ENIG/THFHDIRLGNAFWDMTPKA*ATK 
EN* IPWITSK*QT\FCAAKNTVN*VKR* 
PMD* E/RIFAKHI SEKRLI CVI YKEQLE 
IiN 


1048 


14949 


A 


1055 


28 


209 


I DGRSRGLWW\ CWLASQLLWEAEAGGS 
LEPKSLRLQ*AMIM\DCTPPW 


1049 


14950 


A 


1056 


163 


479 


TSFLGSQSAGITSVSHCPSREVPFLKLI 
HWRQGGQVALLVATPHSPCCPQYRIiAPI 
PRARHDFACASLI FVCILIiVHVLLMPRS 
VAGRGVGSGIiLG/CQAGR*L*TC 


1050 


14951 


A 


1057 


2 


1258 


ELFPWHPQPSRAASLWAKAIRDKVPGT 
RRWPVPTDGRPEGFALSEPHHSCLHGRH 
QGPERCRHMAAS P * EAGNK* PKDKDGHP 
GERTETAAGVHTGCGGKGPAAAGGR*AQ 
GGSGEAGAAAGGAGAAVGGAGAAVGGSA 
GGRWPAGLLGQHGAG* GEGP CRQHGPP P 
GVSAGQAASPAKAAGQPGPGT*GAAGQP 
GRG*GPAGPRGGAAAERAGAGAMPAQGP 
AGAAAEPAEGEARPGAGDYGPAANHPGA 
RTGAGGAEGAGAAAGGLPRPAQAHR\AQ 
IHGPVPLGTGGRSSS/GGIPDNMSHRLW 
QRHRSHGEASAGQRHPHPGPTGGER\GL 
QSMLSKIREVAQQGGLKVGLRGRALGDQ 
EEAPIQQQVFRLCPGNLWR/RPPHHMRP 
*AVLIi*ITIFHISRRREDVMDPMPSSPIC 
Pli 


1051 


14952 


A 


1058 


2 


437 


ERSVRTACCKQSRGLSSLLCPPPAPRLS 
*TGSPVTGSSALEPRSPHPQP\PSLRPR 
S PGPQPLHPLGPRS PS PS PEQTVPSHPA 
RLPSLS PERTQG\ PKAQ* SQLPPPSLAL 
AQPAPAVGGREEVASPACPSWKDKSRLR 
AVPGSA 


1052 


14953 


A 


1059 


47 


456 


TRCYGTLLRLDAPGKLWTRKiyi*VVASTY 
ARTDRKSSASFRPIiLPLPRGFQLSL,HFS 
LTSPSCIjAFSANTHRGLRGENYHITK/C 
DMAPKV/HISr*TQAWQWHDFGSPRPPPP 
GFGRSSCLGLPSGWDYSPLWHHEQIIa 


1053 


14954 


A 


1060 


484 


13 


FPTSASQNTGTTGVSQHAQPI IF/LFVE 
MGSPQV/GPASLKLPTSRDLPTPASRAA 
ESVSAHHH*FLNFL*RWASGFVAPSWS* 
TFLL\KQSSCLSLPVC*DCRCYNHWAWP 
HSVFSSQFLSLFPFNFSFLFFLSFSFFK 
DRISLCRPGWEFHQTVQWSR 


1054 


14955 


A 


1061 


1 


494 


GS PGHPVCGRRRSAIiENPGQPCSLQPGV 
VSGATGAI VRVQRTS S AWATAAAMGAYV 
E/TTRFRYT*KAGVGCDRLWGAWLKADG 
LGETKAEHTLHDG/ PPEDALYGL I EGDD 
TNFTIQGEWHCWYTGTLPDGTVFDTNI 
QTS AQKMM/NAKPLS FKV/ G I GKVSH / G 
WDE 


1055 


14956 


A 


1062 


3 


395 


sdpspcggirfdemekflenynlpkiike 
/dkils[dptfvk*igsvmkifhtvep/ps 
hkcftsefy* tfke * i ap i shklf * srg 
imgtatppvmferlqenvipk\ptekk*t 
yrpvslmninXkilnkiltyk 


1056 


14957 


A 


1063 


402 


260 


WYCPPKV/LLRFSVYSSPPEVWAVGSIK 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H==Histidine, I=I$oleucine, 
K=Lysine, Lr=Leucine, M^^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threon!ne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














AELYMLRPLFPRTREVD*IF* ICPV/LK 
GSPKNKDFCP *GAKTFEYFypATLGLNL 

KI*IFRQRW 


1057 


14958 


A 


1064 


66 


379 


EFGRGCGEPKLRQCTPAWVTEEDPVSGG 
KBCRKRKEREKT*REPSVSRPESH/PPSQ 
EAFWWQAIiSRPVLPNFISGRSASGLGK 
SEAIKSLFLYIiAVCSTTEEQA 


1058 


14959 


A 


1065 


223 


437 


KSCMS CDSNKI FFLRQGLS CSLTQAGVQ 
WHYLGSLQPLPQGPK* *SHLSLLSNWDY 
RR\PPR 


1059 


14960 


A 


1066 


414 


65 


NKSPDRNEIiAS * KCTQ*THTS lARDFFA 
KTIQWGKNSLFNIWC*DNWISTSKRIKL 

DPYI I P ytk\tnskwirdlhvkaktikl 

IjEKNGPGAWCPCNPSYSGGLLEARSLR 
PAWAT 


106O 


14961 


A 


1067 


418 


1 


LWVFFFSRQSL/DSVAQAGVQRCDLGSL 
QPPPPRFKRFSCLSLLRSWDYRCRPPHV 
ANFVFLVET\GFTMIiARMVLNS * PCDLP 
AAASQSAGITGVSHHARL*FCGF*YIHR 
VAQPS SQS I *DDHP I KKFCIL* QPTPFR 

s 


1061 


14962 


A 


106S 


24 


440 


E V*NL YS ENYKMLLKSMRGDLNKWKS I P 
*S/WI*RLMIVKMAVLPKLIYRFNLIPI 
RIPDEFFVKSTS*L/CKFIWNCKRLRIA 
KTIF*K*RTK/SGDIiI]:jTDFKTYYKTMV 
IRTI*Y*QKDRCID*WDRIES/PEMNPY 
lY 


1062 


14963 


A 


1069 


442 


26 


LRWQVSTPSSKTPRFFFFFFLKRQTGAQ 
RHNHSSL*PRTPGLKQPSTSASGVSRTI 
GASHHTPLIFFYFFN*GKEGWGGCSCFV 
AQAGLQL\RLQMILPPQPPLKCWN*-RHK 
P P SLT YP I FLKRTP * S CFSKEKATRI P P 


1063 


14964 


A 


1070 


294 


46 


KNKLKRKERKENNKKKAKIND I KNKSML 
EKIHKDKN*FFERNNKIDKLLATN/RIT 
KKTQ I VI FMNKIRD I TTYPTD I KNVKR 


1064 


14965 


A 


1071 


141 


469 


PKKQGVQLTQNAPLPFRVGGKRGLCPKK 
KKERERDRE IISIIIIII KLPTKKTPGP 
DGFTDELYQTYK*riTSILLKVFLKKRRE 
CFL/PYSMDPAVPAIALNPPP 


1065 


14966 


A 


1072 


49 


472 


EFQIIKK*NSFFADTEKALVWIENQNSH 
IJMPLSQSIilQSKALTLPNSI^AERVGKD 
AEEKLEASRGFWPMRPKKPS P *HKVQGE 
AANLDVEATASYPEDLPEI IDEGGHGKQ 
/QIFNMDEIVFYWKMPSRTFPAREBRSM 
PG 


1066 


14967 


A 


1073 


82 


419 


LTVGFRETRSSSWVCSSSPKRALHATjRF 
TPALKF* FFFWSFSRIiFFPFLRDRVLLC 
CPGWSPTLGLK*SSCLGll.PKCWN\yRYE 
PP 


1067 


14968 


A 


1074 


1 


407 


LEDGFFKITQSDKKEKKRI \KKCK*NLQ 
EVW\DYVKHSNLQVIEIPE*EVKCLSNL 
FEEVIEANl/PHLASYIiDIQIQAVQRTQ 
RGYIARQTSPKHIWRLSKVNMQEKILN 
LPKEKHLITYKGNSIRLTAKPSPKPT 


1068 


14969 


A 


1075 


3 


396 


LHAYHIVKLSP*PLTGALSE\LLMTSGL 
TM*FHFHSITLLILSLIjTNTLTIYQ*WR 
DVTRESTYQGHHTPPVQKGLRYGI ILFI 
TSEVFLFAGFF*AFYHSSLAPTPKLGGH 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H— Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M^'Methionlne, 
N=Asparagine, P=ProIine; 
Q=Glutamine, R=Arginine5 S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *-Stop codon,/=po$sible 
nucleotide deletion, \=possible 
nucleotide insertion 














WPPTGITPLNPLEVPLLNTS 


1069 


14970 


A 


1076 


250 


47 


VAACQNAFCTFSCYWL*NLLCRK\MDLD 
ISLLP*TKV]SrSRWIAGFNVRALTIKILE 
ENIGSAFGNRHSQ 


1070 


14971 


A 


1077 


2 


393 


pltgalsnlllnsgiamrddlhsitlli 
lglltstltiyqgXwrevtqesitqghh 
tppgqeglirygtmiafitsegflytgff* 

AFYHSSLAPTTQLGGHWPPSGITPLYPV 

ES PLLNTSALLA* GVS I T 


1071 


14972 


A 


1078 


2 


500 


LHVIGVSEGEEN\GMKQNKI FEEIMGPN 
F/PNLVKYINA* / IQEMQ*TPNGIHLKK 
TVHRYIIFQLVRTNNKERILIVAREKWH 
GIFGGTMIi*MNDDFSSEFIKARKIEMMF 
LKYKRGI SS SAKHC*SRI LCVAQIS FVS 
KG/E ITTFSDKRKLRAFI ISRPAHRNAK 
GK 


1072 


14973 


A 


1079 


419 


1 


LRWS FALAAQAI VRWRNLGSLQPLP PS S 
SNS PGS WRLQ * ANTAPLRS S LGNKNE TL 
SQKKRK* KDNSEWEKI FANHASDKGLVS 
K* I KNSYNS ITKN\DI*KWTKYLNRHFS 

KKDI KMANKHMKRCSTS 


1073 


14974 


A 


1080 


468 


2 


KLPPPRGAGSSAP/PLFFPPTKKGFFSP 
PP P * KFFFS PNPL r FLGGFFP I FPPPKK 
NFFLKNPKGFFFSPP*KKKKIFFFPP*I 
FAPPKFFFKSPPPLFFFFFFFFFFFFFF 
FFFFFFFLRTRGRTRGRTRGRTRG 


1074 


14975 


A 


1081 


467 


1 


PTYKFNAIPIKIQ**FFKKIENCILKFL 
/WNLKEPQIAKTVLK*NKVEGFTrjPDFK 
\LTVTKTVWS*HKDRHRDE/WK*TESLK 
IKPYKYDQFFFFFF*FSFL/HFF*YI,TY 
SQHHKLIFNRMPKPFNKBRLFNK^CWEN 
WISICKIIKLDHYLTPYTK 


1075 


14976 


A 


1082 


387 


3 


LiKQPPPHKSFVKNKNGVSLCCPG*F*TP 
GLKRSS\HFDLPKCWDFRCEAPVLSLLN 
NI KL * L / CGRGL I KPQFP / S VK* K* * AP 
SWSAHLAETFFFFRQGLTLLPMPGVQWH 
DPGTVQPQTPGHK* SSHFSL 


1076 


14977 


A 


1083 


2 


435 


FHPPTNWGGFSPPPP*KFFFSPKALNFG 
GGVGPNFPPPKKRFFPKNPRGGFIYPPL 
KKKNFSFPPPLKLAPPRV/SFKRPPPFF 
FF 


1077 


14978 


A 


1084 


447 


47 


PSRKFPFLAPPKRGFPPPFPLKNFFFPL 
GGLFFG/RGVFQFFPPPKKGPFFKKPRG 
VF*PPPKKKKKFFPPPRENLGPPGVFLK 
RPPPFPFPFFFFPFFFFFGRRFVK 


1078 


14979 


A 


1085 


438 


2 


AKNHPKGFSPFKKFSPPFGGGGIFXRGQ 
Ii*KSFFFFFKKPKFIiV*KPPFKIFFF/l 
FFFKQRVIAL/CPPGGNKGVPQGPPP I F 
FSRVGKPLFFNPPKNWGKKNPPPTPGKI 
FPFFCSFFL*RQGLTTSHRLVLWSWA*A 
riiPLQPPK 


1079 


14980 


A 


1086 


43 


265 


IBCHKLEYIKIKNFCASNSNR/TERQPME 
WEK\MFA]SrLISDKGLIPRIHKQLL*LN/ 
TKQQNSP I * KWVKDQLQKTP YL 


1080 


14981 


A 


1087 


11 


412 


lASGLHDFFNKKKKKKKKKGGGPLKKTP 
GGPKFNRGGKKKI FS FKGGEKKT PGGFL 
EKNPFLGGGKMGQNPPKKKKPFGEKKNF 
* GERGEKKPKFPGGKKMSPS PQE /NKAP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparag!ne, P=ProIine, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














RDDPRPPE 


1081 


14982 


A 


1088 


2 


292 


RSLPASADSSSLVAASLAGVRDRVSTCW 
PGWS /AV/VQTL* SARLGLPKCWDYRCE 
PPCIiAEAPSFMRSGKASCTLETVWEDKH 
KYEEAERRFYEHEAT 


1082 


14983 


A 


1089 


376 


3 


HIGLYIASCKTLLKEIKGS*IDILCSWI 
ERLNTVKKAILLKLIYGFNTIAI/KILS 
GLFIETERMILKITGKCNRSQTANTILK 
PNKVGRLALPNFKTYYKATVIQTVWSWH 
KDRWYGPFCVPTQI 


1083 


14984 


A 


1090 


277 


407 


Q I KAERSHHKQI TSMRNVKHCS SENYDS 
*RKPAPGYI I IK\F*K* IiIKRKILNSGR 
LKNRPIKEE*RYAyQDISETIKVRR*EN 
DIFKVL\KEKKSCQPRLLYPLKIS* lEV 
KYFFDKSKLKBVITNRSPV*EMIiNIVLQ 
/INYDSPYEP 


1084 


14985 


A 


1091 


421 


2 


GHPGVLFKG**A*NEKSL*NCSFFNY*S 
SFSNMQTRIKNV/WPSTVAHACKPSTLG 
GRGGQIT*GQEFETSLANTVKP 


1085 


14986 


A 


1092 


369 


3 


FLKEIRSVTPVNT*ENLIAD/ IGEKVLV 
VWIENQTSHNIPL6QSLIQSKVL/TFNS 
MKTER*/EKIiQKFSASRGWFMRPKERSH 
LH/NI KVQGEAESADVEAAGVYPDPAKT 
IVKG6YTQ*QIFNVDV 


1086 


14987 


A 


1093 


71 


506 


FAEDNGIi*LHPCS CKRHDLAVFYGCTS F 

VLTFGL*PWFLTQS/FLJTPLEFS 


1087 


14988 


A 


1094 


118 


385 


SDLGKDFMAKTS KAQAAKTK \ I YTWD YI 
KLRSPFSAQ*TKQSXE*RRQSTE/WKEK 
I FAD YS SNRGL I S S RQETQTTQ WLKKIN 
KNALCTL 


1088 


14989 


A 


1095 


419 


1 


DAKI PGQMVAR/RI PRP I KKI FCPDQVW 
FIPGMQGGFHIGKSIIWAPHIQMG*KSF 
NNIQHPFMI*KKKNLSTGTTHEGDITQH 
/R * MLHEHTTNAI LNRE KLKAF PLRTG I 
RQGCLL* PLLFS lAVKT* P*AMRQEKEI 
N 


1089 


14990 


A 


1096 


34 


464 


NSSKKEKRKIPHDLGLRFLDiyiTPKT*ST 
NGMMD / KDF I KI K/ FCAS KDT INKVKRE 
*EKLQIIYL/DKRLVPRIYKDSYYPIRQ 
LKKWWNLNKHCFKEDTQIANKHKKECS 
T\S IRE*QTKTR*RSHFISTRVTKrKMS 
G/N/NKYWQGC 


1090 


14991 


A 


1097 


458 


28 


RI IKVDLKMCTHNFDSLEEMNQFFKKSR 
WPR * NR * FE * S YNH* RN* I HNLKVSKS S 
GPDAFTGSFYQTCEEELAPILQNLFQKM 
ES IRSNS FYiOmi TLIP*GDNGS /TKKE 
NYSWSFMNLDAKVLIKILAN* I*VFIK 
REFH 


1091 


14992 


A 


1098 


41 


500 


FWIRHFLSRHKQRKNR*MDFIKrKNLYA 
SKDTVKEVKR*ST* WENYFQI /RMFDTG 
IiVPRVYKEFI *LSNK\DNHI *QWGKKAV 
SRGFSKGNTQMAKKHMQRCSMFFVIRKM 
* I \KPKMRYHFTPS SMT*EKNKDNTCFK 
SDGEYGILIYYYWGI 


1092 


14993 


A 


1099 


1 


484 


RRAPFFFFFFKRSFPFWARAP/LQWAFF 
GLVQWPPPRQIPPPGIiHPF/SCPNPPKK 
WGYKGPPPTPGKIIiGF**KGGGPRLNKK 
IFISRPCDPPAPTP\QECVGIQGGTPPPR 
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C=Cysteine, 0=Aspartic Acid, 
E=GIutamic Acid, F-PhenyJalanine, 
G=Glyc!ne, H=Histidine» I=Isoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptoph an, y=Tyrosine, 
X=Unknown, '^=Stop codon, /=possib]e 
nucleotide deletion^ \=possible 
nucleotide insertion 














PFPFFEMVSCSVAQAGVQSCDLRSLQAL 
PPGFEQFSCRGMPLKWIGSHPSPHP 


1093 


14994 


A 


1100 


144 


260 


ILILDSATjGHPEPHEFNTKGIEWYLPIi 
NTMYIIQPLDQ 


1094 


14995 


A 


1101 


265 


457 


DPAAPLL.VTYTREI \*MYITKRyTAALF 
IKTKlSrWK\RPKCPSKGE* INKLWYICTM 
EYCSAIKNE 


1095 


14996 


A 


1102 


387 


40 


RLRADILRILVYNNFLQRLKKQRYLFTG 
PQYRFLFriELWLCV-CVCTOTCVCVRANF 
*IISRDKVLLCCPV\CFCFI.NPPCVCVC 
VCVCVCVRACAI INKLLVETRSCS VAQS 
GLR 


1096 


14997 


A 


1103 


465 


208 


QNMTKTFQDYRPISLMKIYARKQQYRYR 
Y*RMIiHNYQMGFTLGLRGWFNI * K/S 1 1 

IHlN**EKKIiyiIILIESEEAFDKC*QSIi 
II 


1097 


14998 


A 


1104 


57 


250 


MTCSRMQ/RNIKLLRYKSEKL*EENSTS 
RNE I T I LS EEDS FTlsrVKLEN * MKTVKQK 
KKTSKKTGVYKMAIKKQISEFKGKTSNF 
Y* EQ INT * EKKNFKENWSL 


1098 


14999 


A 


1105 


2 


378 


YVDPRQCGGILKGVLKGEFTISNEKANP 
GRGSPSSVSKD/L* CQNIKTIESKKTCL 
I QKLI KVKAEI YERESRKTMEEINSTK/ 
SQFFEKI/NKID* /PIARLTKKKKKTQV 
LKSEK\GNITTNHTEIKRV 


1099 


15000 


A 


1106 


291 


1 


VSIYVSYNLDSCKGQIRIKSCDDQYIFK 
RIiYFFEITFLKFLN*K*VLKKIRDMVLL 
C/YPGWPQTPGLKQSSCLSLPSSWDYRH 
VPLHLAPINYFLW 


1100 


15001 


A 


1107 


24 


397 


DIDHVWEFETVFYHVGQAGLKLIxTSDDP 
TALASQSAGITGVSHRTRPSSGHFNNTP 
EPPEASSLSSHPKLHKSPVT*NGAGL/Y 
GS SKLLS CVIiNGP I SLVHSTLRLRKGGP 
QGNISQISLMAPP 


1101 


15002 


A 


1108 


64 


414 


AQriTPTQPQGRAALHKGHV*RKTAPTCL 
FMAEKNQAASFFLFPFPSICINKE/HFK 
KKKKKKKKKKKKKKKTRQKKKKGPEKPR 
KQPGGGIjLISKILWAPPPGFFLTREGP 
PPIFF 


1102 


15003 


A 


1109 


62 


475 


FB PLF YLMCLLNLFPLQLPRHPFIiFIjTV 
DIiVNTWGCPIiPSSPQ*BWLLAAPHRSTP 
PPLSSGFPARRQLEPGAGARGP/HHTQA 
LHLSFFFVFLRRSL/DSYAQAGVQWRGIi 
GSLQPLPPGPV\II.SSPLSLPSLTY 


1103 


15004 


A 


1110 


76 


477 


eewkklyrwrenisnlisdkelic* in/ 
nqtfni qpiiktknptkdvwdlktftned 

TQKYTYIHQ*R\SHMKR*SP*LFI\KMQ 
MSTALRFLYPPITMDRIQIPENIKFWRG 
CGILIHSW*TRKMFHSL*KAFWPF 


1104 


15005 


A 


1111 


1105 


3 


KEERSETLAKaKLlAAGA/YN*KEERSQ 
INNLiTLHLKGGKKTNDKPSKR/QE/l*Q 
IRAEISRIRPRKKKEKNNEFVFLKINKN 
FKTHSYIKKKREISNTKIXNKRGDSTTD 
ATEMKKITRD*CEQPSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSPPKKKIPGPDNFTGEFYQTFQEEL 
MPALCKI FQ\NVQKPEGD I /PNPFYQVN 
lYLVPEPDE/QPFVIKTLKELVIYGN/Y 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid^, F=Phenylalaninej 
G=GIycine, H=:Histidine, I=IsoIeucine9 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline5 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














h/K* IKIIHEKLTAlSrilLNGEKLKSFPL 
GSGlR*QHLFLPLLPITVLEFLLiRIISQ 
EKVIKRiyiQIGKEKV\KILFTDNRM*WV/ 

ENP/WSQNIGKIMKFNKVSGYSNTHKS 

VAFP * TIvINEQS ANKI LK 


1105 


15006 


A 


1112 


37 


466 


DRAAE FPTENPLEL I SKAS KVSEFNVS I 
QRPIVFLCISNKQIjENEILKIQFH*/PI 
S VASKNSKYGGINLRICYL*DMHIDNK/ P 
LNKNRGGLCSWSATSVLPKLIDNFSAI S 
GNYSRFFFFGEIEKLJSrLKPIWKYK*WV 
AKTVSY 


1106 


15007 


A 


1113 


33 


436 


PGLN* * CWKNRISTCKRMKLDPNLAPYT 
KITSKWXQDINIRP VTIKI LKENRGKGL 
QH*MWQ*FF* I *PQKQE * TWRTKSDFIK 
TAQQRKQQNERQSVEWEKI /FANQISNK 
EIjLS KI KFRLLPLDNNKQLT * KWA 


1107 


15008 


A 


1114 


2 


391 


PLTHSRMAIKKETK\NYKCWQ* SGGIGN 
LGNCW* ECKMF* PLWKIVLQFLEKKKRN 
TELP*NPLIPIIiGIH/YKRNVCIQMFTE 

ALFIITK/SWKQPR 


1108 


15009 


A 


1115 


3 


458 


I RDPLE EAVCP FADLKLHAGRTTTLFRA 
VRQGPLSL*KLLLPFVQLSHVPRGGVYR 
GSQASLSCGGLHPVQAS*LLCLPTQASA 
MADAP \ P PASL P P CS S ISDCCAS SERGS 
VGMGPSEPGMGYNLLVCRLLRPLEKPS I 
RVGVSLFSRYHM 


1109 


15010 


A 


1116 


204 


13 


PKMKTDSRWILGIiFF*FLRQSL/SSVIQ 
AGVEWRNPGSLQPPPPRFKQFSLENQGI 

PPHGLWS 


1110 


15011 


A 


1117 


34 


454 


IHISWEFLTSKLYGrLTSQYNKKTNIL 
I K*MGKRFNAHFT * EGI * I VNKHMKRLP 
TS S VI REMQTKTT VGFRS VLTRE AG I KQ 
TDS/NQAWVRKWSNSTLLCCWWKHRVLQ 
PTWN\TAWQtiLIYKMIWQLHPYAQKDIY 
VN 


nil 


15012 


A 


1118 


83 


451 


CFLKFFIiYRELISPFIiL*Q\PVRPSPAF 
RRKPPPWVAFFFSDPS/FSFLVFFPKGI 
VFFLGDALKKVLT* KKNFFFPGRD*VLL 
CCPGWSRTPALKHSSCLGIiSKCWNSTTR 
P 


1112 


15013 


A 


1119 


316 


441 


F I FFFFIiKTNFNFFAQ VGGHNRNLG* LK 
LPLPGLKQFSCLTL 


1113 


15014 


A 


1120 


67 


312 


RQirQWTRIES*ETDPREYSQIiIFDKGA 
NPIQWRK\LFNKWFWHNWTSTCKQIKKN 
LDIVLASFTKTSTSHRLRENLYRIHI 


1114 


15015 


A 


1121 


27 


490 


GTQLHSREKKNSPFNK* YWEN* ILTCKR 
MKLDSYFIAYTKINSKWIKDIiNVSIiLGL 
liAKIK* S PKCKI QN Y* TSRRK* KGKDLG 
Q * FLGYDTKCKATE * KNK/IiNKWDYIKL 
KNFCIA\NKTINRMKRQPTVWEEI lANI 
ISDKVLISRICKKTNR 


II15 


15016 


A 


1122 


490 


48 


EHTNKKRDTLTVDIGKVLVI*TEDQTSC 
N/ IPSNQSIilQSKAIiTLFNS I * AERGEE 
ATE/EKLEASKGWFMR*KERSHLYNIKV 
QGEAASADEEATASYQHDLAEIIDENGX 
KQQIFNSDKIALHQKKM/PREKSMPHFK 
VSKDRLTLLREF 


1116 


15017 


A 


1123 


2 


259 


I YGQLI FNKGVKT IQ * RKNS VFNRCYWD 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C^Cystelne^ D=Aspartic Acid, 
E— Glutamic Acid, F=Phenylaianine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=po$sibie 
nucleotide insertion 














N*ISIWKRM\GLDPyLTPHTKINSKWIK 
DLNIRTKTLICSSWRKKKRENFLKKKRK 
KL 


1117 


15018 


A 


1124 


206 


24 


CSIFFCIYLFIYLFIYLFIYLRQSL/DS 
VTQAGVQWHYFSSLQPLPPGT\6LF*RT 
SYSWDF 


1118 


15019 


A 


1125 


3 


351 


RRGRGFTILVWAQTPDLM/HPPALiASQS 
AGITGVSHCARP I CLPFKRQGF /NSVAQ 
ARVQWWDHGSLQLQPI,TLR*YyHLS\SW 
DYRRM/PAMLGFFVFYAYPGITLLPRWP 
WISGLITELP 


1119 


15020 


A 


1126 


2 


349 


SraHKDIGTLYIiI,FGA*AGVIiGTDLSIiIi 
IRAELGQPGNLLGNDHTYWIVTAHAFV 
IIFFIVIPIIIGGL\GIWILP 


1120 


15021 


A 


1227 


362 


2 


RVSLHCPGWSQTPELKLS S CJjS IiPKC * D 
YRCEPPHPAYLTS F ITEDS KNFEYVTTL 
F*NNIY*TMFIEIPP*VHYIGAHIiF*DQ 
LSTFF*SF*N*FFVCLFFRQSHSVAQA6 
\VQ*CDLGS 


1121 


15022 


A 


1128 


1 


349 


I YFLTRSHS VSRAGVQWHSSGSLQP * P P 
RFKRSSPPIiSLIiSSWDHRTWG/HMPP /H 
SSSFSSSLFLRSGWGGKGVLWS 


1122 


15023 


A 


1129 


367 


24 


INCVFYPCKGVTKKPWYT/HF*RKGFI I 
INI FHGP FKLTF VCPEGGKNS ETL YP FC 
FLPLSLAIKKSRPLGVFLV*KGPLSPPIj 
QGNYGVGWVGFFFFS AFVI F I KKLVLS P 
YYKN 


1123 


15024 


A 


1130 


61 


299 


IPRVDNYCGYVREHLFSFLFPFKRSLV/ 
SVTQAGA*WRDLSLLQPLPPGFKQF\SC 
FNLLKETHC YRD VPLLLNTLCI I FKT 


1124 


15025 


A 


1131 


2 


261 


FFFFLVFSFPLLRLFLLPSPSSPPPSVF 
FFLPSSFSL/CLVSRDLLTLSQTKVQWR 
DLGSLQPLPSRFKRFSGLSLPSS*DYGR 
TPLW 


1125 


15026 


A 


1132 


2 


381 


E F YFF F * S S QME S P S I AQAG VHWRDLG / 
SIAPYASWVQVIILPQPPPPGFESFSCIi 
RFPSTWDNRHAPPRIiAKKRNKI 


1126 


15027 


A 


1133 


2 


221 


PSLiLKKLARHSGRCL* SQLLRNLRNENR 
/GVPAWVTK/GETPPSKKKKKNPPPKGP 
GAQTFETPGFGKPKGKIKV 


1127 


15028 


A 


1134 


331 


2 


RGPPPPLKEPSGVFRIFQFKNLESRGGG 
ERGKQSPPLGPPIRRGLSPKPPPPEKGL 
TVPFFFFFRCKDGVSFCCPG/WISRTPG 
LK*SSCLGLPKCWDYRHEPPHPAPAL 


1128 


15029 


A 


1135 


3 


391 


AVNTQMMRK* KSLIADVEEV* WWIEDQ 
TSHlsriPLRQSriVQSKAL/T*RAKPSMKA 
ERGKE AAKGKLE ASRG* FMRFKENN\RL 
HNIKVQGDTASADVEAAASYS *DQAKIT 
' GGCGYTKQQIFNTDETAPY 


1129 


15030 


A 


1136 


107 


350 


TYSIVAIVMRFYFFIIFYFLRWSLAWI 
QACCCHPGNWHDGSLQPLPPGFHKFSCIj 
SLLSS*GYRCPPPCQA\lFIFriVEMG 


1130 


15031 


A 


1137 


35 


297 


TLMHYTORVKEKNI X IS lEAEKAFDRIQ 
HPHM/ IKYLNTIKAVYNRPTAS I ILSG* 
NLKAFPLKS VTQQGCPLS PLLFNI I LEV 
LSTRP 


1131 


15032 


A 


1138 


378 


1 


PFTKINLKWIIDLNIKHKTIKLLEDIIG 
ENLDNLGHDYDIIjDPTPKA*AMKKIIiIN 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
l\=Asparagine, P=Proline, 
Q==GI u ta m i n e, R=A rgi n ine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib]e 
nucleotide deletion, \=possible 
nucleotide insertion 


- 












VAL C * NHES FCS ENDTVIRMRK*ATDRK 
KI FAGDM IKDCYSKC/NE KJ\E VNNMKKN 
NPIYKWAKDLNMLL 


1132 


15033 


A 


1139 


271 


361 


KW/YWNPIHIISQVCL*GPEXYQHVYGQ 
MTKQRCQGNSM/WAKDSLPNKWC*NKWI 
PF/C/RI*NLNSYIiTPYRKIK*KWIRDQ 
KTITVTTKXiLEENMRFSFAFGIGKD 


1133 


15034 


A 


1140 


28 


340 


LLYF*YKRGFTiyiIjSRLVSNSRPQ/CDPP 
TSASHTAGIADGSHHARLLFFIiEK\EPA 
PGPPAGRKGANPG* REPS PP6FRGXPP / 
LPPPGNWDXYGGPPPPKPNFGFF 


1134 


15035 


A 


1141 


94 


128 


GSQMPRHriVD*MTRHrjATLRES \CySR\ 
VYPRFIEPLHFDIQSTGQKSHR 


1135 


15036 


A 


1142 


85 


492 


VWVGLLSLEGSPSKFGNFIEFGVIiLSSG 
GFSAWRLFFFVYFLRQSL /NS VAQAGVQ 
QWRDLSSLQPLPPGPK*VLKQRGVCLFV 
CFETESHSIAQAGTQWCDLGSLQPLSPE 
FKRFSCLRNSRSLRRDIEPSEGNQC 


1136 


15037 


A 


1143 


372 


3 


KMNR* P I S I KE I DF I VKNL PK \ KFGPDG 
FTGILY\RHFKKEIIQTVCNLFQKTEKE 
GML S \ I YKAS I AQ I P KPE KDKQTKATD / 
YKPSGIEAKIINRIIiANGIQQYl\HDQV 
RFTPGMQGWSNI Q I 


1137 


15038 


A 


1144 


56 


482 


TMKTLLYWQKNG*VDQWNGIESPEIDPY 
K\YIQLVFDRKAKTCNV/RKDSLVNKWC 
T YAKKKKTDNL FFTP FTK I TS KT YLKNW 
N/SVNFLNTPLGKTLGDLGFPRDFKFLN 
TPKAKSLEKH/MDTIiNFFKGKI CS*KNiyi 
VKKNKKT 


1138 


15039 


A 


1145 


190 


482 


RQGLALSPRLECSGVIIAYRSLKLVGSS 
DPPTLASQIAGITGIRHCPWPKTLFLAS 
VIMPAHNS*PSLSPVPSP\SIiPSPPLLA 

SQSRRSQ 


1139 


15040 


A 


1146 


138 


485 


IFFPSV*TIFLLLALFFS**MYLIFVKS 
SLSIYSVYQF/SFLGFWSPISDPKSQRF 
SLLSFIVCFFVFPRNRVLLCHPGWSAW 
*S*LTAVSNS\LLKQSS 


1140 


15041 


A 


1147 


101 


426 


GD WKKFYI YKHS ES KS PL ILFEKKKGVL 
EEYSSFDI**AIKVIYHISRKRK\KNQL 
ANLTDAEKALNKI *HPFMRKTTQQFSNR 
SFLHLRKGI YKKPTANMIIKEQI IF 


1141 


15042 


A 


1148 


458 


15 


VFVCLKI CKHRKGTAKKWYYNFMGLPLY 
MRSWDQNGTM* KMTI YAPNIGAPKYIE 
QILMGANREVNSNTILLGDFSIPLSTL/ 
DKELPEIiNFT*NKTDI,TDIYRTFHPMAA 
KYTYFAGTHGTFSKMCPVGFDIRVILiVE 
FRRSTYGXiDW 


1142 


15043 


A 


1149 


1 


507 


KGPPAPPPYYKKKFQGGPKKFPP*SHPF 
KRPRGEDP/YKPKNLKPPWAKKKKTPPFP 
KKKQRGKGPKNPPPWKVKPEKSF*PSPK 
KTKIWPPPPPGGQKKKKKPPPPKKKKKK 
KS VSS S PKNI / YRS IiFCTNLKKS FPT* F 
*NIjMP/TKXiPV*EFPPPPYNSF*SAP 


1143 


15044 


A 


1150 


436 


32 


nlcmyvymyiiiyvrtyvciysi yvs i yl 
idlslyhhhylsicmcvsiypiylsiily 
lcisliyhlfnlciyisiyliyls/yvc 

M/SSI*SMYVCIYLIYVCIFS1SLYYIjY 
lYLiSI ITI lYLYLCMYCLFYLSSIEF 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=FhenyIaIan!ne, 
G=GIycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tb reonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1144 


15045 


A 


1151 


3 


657 


QDCKIQNEHKIQNTLFLYTRNAHIKIKF 
HLHSKKI]:iAWLTKHV*NFYAEl)TYTTLM 
KEKI * IERNKDLHRKQ\KLNPCIWIVKL 
NI VKI P VI \ PHR*RSI*TIPI 1 1 PAK\FP 
GDKEKIILKFIWKGKGIR*LRF*KrRIK 
GGM]:»PN\LRLTWTVIKIV*yWWKERD 
RYMQXWNKIENS /EKL.KPRKYVQLISP 
VFDTSG 


1145 


15046 


A 


1152 


430 


43S 


TFWVKKFFLLNLPK* INPGFKKKIOSfKRG 
G* KRP / S6SQVLKKLRGKIALTPEGKGK 
IRDCFCPPPPPLRKKKKPPIRP* KKKKK 
KRKRDPPFGNHYSNS YRHTS CYFTLLHL 
TLQI FFTSRSPVAMS CRINL 


1146 


15047 


A . 


1153 


104 


478 


EIiLLEGSPCLiRAESKAGPAGRLiVPTLiLS 
WRVQSPGVLCAWEEECEEPPSTGEDAPS 
LAIRETQIKFITRWHIiSPLSMAIFQTNN 
DN\ RTKRK* CWHGCGE IG/T/LLHCWWE 
/CKLTFP 


1147 


15048 


A 


1154 


171 


476 


NSSDYYYYFLRQSFTLVPQAGVQWHDLG 
SLQVTSDDPDLR* FARLSLPKCWDYRRE 
P/RiyiSDSNYFLK*VPAm*KARA*LLFIP 
SSCQAG/ASAGQTLSVS 


1148 


15049 


A 


1155 


3 


562 


PD YQTARRRQAPADRCQPGPAAT I CPR* 
PKARS SSMNRSNPTVTRTAGP * PS WKTG 
AASKSQTGDGDRKEI.IPPNRTPN]Sr*VST 
AAIK*QVRPYLTQKVPAG*TFCPHQKLQ 
GVRPLCRGPP\RPFGSPAVTDNRQAKLiK 
TPTQ/HPADPPEQKRPPARPPVGRHAEV 
KQPGPPFPAGPRPAPSTDGL 


1149 


15050 


A 


1156 


67 


417 


TLSCETQPGQHGEAPS / PTKNA* lALPL 
SWARPLYFPAPARFTARGGPVFTS KRVP 
SPPNRRGWRPPP\QFCHAWNPSPKFLAP 
KIPPT/GPLPPNKEPSKENVNPG/PLCP 
L 


1150 


15051 


A 


1157 


446 


87 


PDCINVISCFLTKIYHVFFFL*RQGIjTK 
LPRLVSNS *AQAILPP/SASQSAGITGV 
SLHTRPIMYFSSYYIIFGDTVLF\ETQS 
HPAAQAGVQ*GlsrLGSLQPPPPGFKRFSC 
LGLPSRRMA 


1151 


15052 


A 


U58 


1 


938 


FFFFIFATYLFNKGK\LTGICRQSD*S ZC 
KHIiKGYYIKKDILTASKH/V/KKCPTSL 
AIREMQILTGIIKCC*RCEGISrGIFTHYW 
* ARAIiAQLIiWKNV* QHILKLSMCI TYNP 
T I LFLGI SNRKVYM/ CCPK/DPCSRMSR 
AAQPVI 


1152 


15053 


A 


1159 


370 


3 


AFKAAAEGAAMSVTGGQPVLNSWWLGG 
VTTTT/CPTTFIMPDNTV\RGDILELTR 
\PLGAQVAMAVHQWIiD XPEK*NKHK/liV 
WEEDVELMYQKALMSfWQNNKKAAGI 
YTFNAHAMDEFHR 


1153 


15054 


A 


1160 


55 


471 


SPPPGLPKKI FFWPLS FYVWP* PQRFCP 
LFAPFKQESNPLKVWPPFGALQNKRSKG 
CSPSM/SARQES *PIjFPKNLETPPFG6K 
FIiKFFFFFF*DGVSLGHPGWSAI/lDSL 
QPLPPGFKQ/SLCLSLPAS 


1154 


15055 


A 


1161 


3 


467 


GYTNQQIFPVDKrAFy*KMPSRTFMARE 
KSMSGFKPSK/DRAGDPKLRSMLIYHSE 
NPRALKNYPKSTIiPVL\KNKAWMKHFCL 
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Predict- 
ed end 
nucle- 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine» 
K=Lysine, I>=Leucine, M-Methionine, * 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph an, Y=Tyrosine, 
X=Un known, ^=Stop cod on, /=possible 
nucleotide deletion, \=:possible 
nucleotide insertion 














QPPVETYCSGKKIPFKILLVIDNVPCHP 
GALMEMYEEINVWTATNTTS I L* P / VT 
SGVIS 


1155 


15056 


A 


1162 


216 


464 


S SEGPGS PGELPYGAQEFVMKAPQGI LV 
IR*T*FFETESHSVAEAGVQWCDLSLLQ 
LPPLRFBrQFCLSLP \S WDYRRPERHPAM 


1156 


15057 


A 


1163 


29 


433 


AVEFGGDSCSPQHPI^PLGSPQHSPPP 
LIGICT/REE*GWGGGLPVPCPPALPYP 
STPSWGGICVCYGRGGVAPPP*HPPLT* 
LGGGSTPTSALPAPGAPRHPGHPLMVtiQ 
TLPWGPHTPARKPYINKVLSCVDF 


1157 


15058 


A 


1164 


486 


11 


STCLGLPKCWDYRREPPRPALRCPFFLN 
IIiLFLKFTLSEINIPTPAFF*LVPAWYI 
FFSLFSF*P/DLEHYI*SGICVDTM*VG 
SSCFFNPT/WSISSF*IjVCLDHLYFFLF 
FFFLRQSL/DSVAQAGVQGQDLGSLQPP 
PPWFKRFSCIjSRIPAHFEWSRA 


1138 


15059 


A 


1165 


467 


131 


NRSLEWAKMYKTYDEIKLVTNLPT/RKS 
PGnCKITAKFYHIYKDGIiVLLLKrXLQEI 
QVGFHPNS*YQ\JIIiIPKYSEGTTKKGN 
GRPIFLVNIETKILHEIRAI * VHKQIRT 
LE 


1159 


15060 


A 


1166 


455 


32 


EEGVLKAKRGCKSGLLHQTVNQWN/Alsr 
LEKFLKE I KSDTPVUI *MIRK*NNLTAD 
KEKVA* IQDQTSNNIPIiTQSLIQSRALT 
LFNSMKTERGEEAGEEKFEASRG * FMGF 
KEKSHPHKHKSASKAASADVEAVASYLN 
S 


1160 


15061 


A 


1167 


26 


418 


KIWDYVKQTNri*ITGXPERGGEK'™srLE 
NI FEAI VQ/ESVPS IFKEVDTQEQEIQR 
ASSSSSS 


1161 


15062 


A 


1168 


64 


313 


KWCKGNSYISKDLKELKYLGV/QLK*VQ 
/DL*SENYQILLKEIKEDLNKWKDIPCS . 
* IRRLNIKMVIPSKLIYRFNAIPIKILA 
A 


1162 


15063 


A 


1169 


464 


36 


QQAEAAESLr \ DPGGRGCSELRS CHCTPA 
WATEQS I WK* S INQS I K* KS KLC/ CWEN 
TIiVKHIFHKRLTSRIYKEL\QQLNKKTN 
NSL*K*EKDVNRYFIKKI YED J *MANEN 
IL\ IKLVIREIQINLKEWLSWFCCCCCC 
FVIiFVF 


1163 


15064 


A 


1170 


290 


487 


GSLPHHTPKR * PFLF/ CVFETGSRS VT* 
AGVQWHNHGSLQP*PPGLKRSSHLSLPV 
LIFLFSVEMGV 


1164 


15065 


A 


1171 


2 


413 


GKVFI*FSTI>H/GCNMK*SRS\LL*ALR 
FIFLFTARGLTGIVLANSSLHITLHDTY 
YMGTHFHYVLYIGAEFAI IGGFIH*LPL 
FSGYTLDQTYSKIHYTI ILIDINLTFFP 
HHSLGLSGRPRRy*DYPDAYTTRYIIiS 


1165 


15066 


A 


1172 


381 


2 


PPKLQKNFFFSSTGKFFLGGGRAPSPPP 
KKGFFSQ I PRRFPFPLPKKKKI YFCPPP 
CFGPPPTFFLRAPPFFFFFFLLLFCSFL 
VGVKFXLFIj/PYLFIFRDTV*LCYP6WS 
AWQSWLTAALTPRP 


1166 


15067 


A 


1173 


9 


405 


NLDKKGRNRTPQSWFQANPMASMTFSKK 
KKKKKKKKKGGGALKKKPWGAQKKPGKK 
KKNFFLKGG\EKKTPRGILEKKPFFGGG 
KKGP]SrPPKKKKPLREKKKF*GEKGEKKP 



599 



wo 01/64835 



PCTAJSOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
efh 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r==PbenyJalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














*KFFFKKKFPPPSKKTP 


1167 


15068 


A 


1174 


410 


0 


FSYYPPPTRGGGPSSPPQIFLPPPLIiGV 
FFPFSPLKIFFFPRGFKFFGGWPFFSP 
PKKKVFFKNPRSVYKNPPKKEKK*SLQP 
PG*VWAPPGIFKR/PPPPFFFYIFS 


1168 


15069 


A 


1176 


319 


3 


KEARSVFRVEGRG*KNFSASGPPGGGNP 
GGGRPGRGNFVFLIKKGGPPPGPKG/SP 

FFDF/GGPPPPPPPSGGGSGVNPPPPPF 

FFFFEMESRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 


385 


1 


SASFGLPKCWDYRHEPTRPASVSVTVTS 
SRLSGA*ARSGKGTLVFWAQMVFK\RIP 
LTKYSDHSREPSSLQLCMQSTPSKAEPT 
VAKADKRLCCTAKSSTAKS lAQIKCQND 
AGRS PQESLHVSGRV 


1170 


15071 


A 


1178 


147 


808 


KLKE IKKLiLEENAGXNLYDIiRLGSGFLD 
MTPKAKQQKKENLKWDVIRMKNSCASKD 
TITFYlSr\YRSDKGLV/SiaECYKELNSITE 
RQPNF * KDLNKDFS K/EQMAKS T*KTTE 
RLFIREMKTKTT/ME/YHFLSTRMAKI * 
KDSNR/RLCRKTGTLIHCW+GTAPANLL 
KNCQPIiFPP*6TVWQFLKRL]SrLELPDr>P 
AIPPLVI CPKEMKT/C/CYTE I CTQMFT 
AA*III 


1171 


15072 


A 


1179 


1 


398 


SRSRHCTPAWVRVTLVLKKKEKSKEKEE 
KKRCPCYI VSHQSIi* KPAWQFIi/RNVK/ 
LELPYDPAI /PLIiGKCLKEI * KYAYTKT 
CM*MFIALPIIAKKYKQLKRLSTDEWIN 

KMWYIKLAI 


1172 


15073 


A 


1180 


3 


389 


GYDRVI PNP * PIiTGA\LRALIjLTCGLAM 
*FYLQSMWVLILGLLTDTLTIYHRWSDV 
ARKS TYQGHHS PPAQKGLRYGIMIiYMTS 
EVLLLGGLF*AFYHS\SLCPTPQIiGGHW 
APTGMTPLDPVEVPLLNT 


1173 


15074 


A 


1181 


354 


2 


ANSSDACILDGHRALPLQLPPCGYSISP 
AAQSSTVPMPLLLIPPPHCNRTPSLWHY 
SPASNPTNPSPY\*TAPSPPPPIPRNNP 
FS * KFP YYVWVYTS LTTCSQP P VS SPNF 
TQPSVL 


1174 


15075 


A 


1182 


3 


384 


GA3SIVSG/DI./KLKPVLIYYSSNPR/AI.K 
NYARS I LPMLCKQE KKEKKKK\AWMTAH 
LFTAWFT*YFKSTGET/CAKEKIPFKIL 
VLIDSACSHPRALMEMCKE INWFMPVN 
*HSI/LFCMQPMDQGALSTYKS 


1175 


15076 


A 


1183 


1 


410 


PPLP*KFFSPGGVQS*KGGRFWPPP*VG 
VLPSSSP/IIFFTPGNWGCFSPFSP*KF 
FFSPKGFIFCGGVGPFFPPPKKRFFSKI 
PPLVFFSPPFKKKIFFFPPPVNPGPPRV 
PFKRPPSIFFP 


1176 


15077 


A 


1184 


406 


62 


PPIINMLCSLAPPFFSPPPLRGVLPPFP 
LKNFFFP*GVYFWWGGGPHF/SPPPKKG 

FFPKFPPWVFNPPPIiRKfC/YHNFPPPGK 

FGPPRVFFKAPP 


1177 


15078 


A 


1185 


337 


383 


RS YISFQK* VKDLNKHPSKDDTQMAN / K 
HIKRYSTLVIREMKIKPMTRYHPTPTRM 
AVPF 


1178 


15079 


A 


1186 


3 


471 


IiAPSDK* E3SrRLPGNRPL*EVRSPSARQP 
PHLRSEEPIiRPAATPSGK*GASPPSSHA 
VREGGGGQPPPGQPPRPGGEGRLCPAAP 
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TGK*GAPLP/EPP?GLGG 


1179 


15080 


A 


1187 


2 


406 


FLVETEFCYVGQAGIiELLTSRDPPASAS 
KGAGMTGVSHQVQPQ* * S * LWT * /PSS V 
EAGTSFGLSFLSSSWALSAQEGCLAVPS 
/SGSRGLLVGAIiLLWTKPSPQLSPVPAS 
QRLSSLSLMPPLPQPQHLTHTSIET 


1180 


15081 


A 


1188 


160 


459 


NFMTIDMriCSAWIHFCSSGLDFQL*KL 
FKSQ* ENNLINKWAKDLNS FT I FSYG* K 
LAHEKMLiNIIREIYFTAIMS*YTLTRTV 
KVKKTDTKGW*GCTATGNLIHCCKNV* P 
LKNTAWQ\FHKILNIYLPYHPAIPLSDM 
NSRE*IQTI/CHAKMCAWIFI 


1181 


15082 


A 


1189 


232 


2 


KAPPPFFFFFFFFFFFFC* SGSHSVSWA 
GI * W / PGV* WHHHGSLQPQS PRLK* SS C 
LSIiSSSWDYRYVPPHLMLKK 


1182 


15083 


A 


1190 


2 


402 


PRVRHASGSPSPPPPPPGL\SHTSPSQ* 
VFSWPSY*TPCIiSALTIASVLSLL*QRS 
PRTLFITNKC/DF/PASHSSGRIPAGL* 
ALGRQGLFSCFFCFFETESHSFAQAEVQ 
WYNLSSAQPSSPEFK*FS/CLSIjPSS 


1183 


15084 


A 


1191 


19 


390 


WCVPAVPATWEAEPRRSR\RSKPLTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKLLGGPKYKGGKKKKFFFFKGEKKKS 
LGGILKKKTFFWGGKKWPTPPKKNKALK 
GKKKFLGGRGGNP P 


1184 


15085 


A 


1192 


485 


112 


QRPDPR \ S AEAAI K YFLTQATAS 1 1 LL I 
AILFOTILSGQ*TITISITTNQYSSLIIIM 
AI AI KLGI AP FHF * VPE VTQGTPLTSGL 
IiLLT*QKLAPISIIYQISPSLNV 


1185 


15086 


A 


1193 


55 


385 


thafadawadawglfkg i ltenfpnlqk 
yiniqvqgg *rtssrfntkktnsrdl 1 1 
ilpkvkdklskkkkknXaenkkilkyrg 
gpf**kpggaqfygggrksfffffgg 


1186 


15087 


A 


1194 


408 


3 


CQSA\LLGGASQLG\SRGSGVRDPLEEA 

vcpfsdlqlhaerttallkavrqghlsl 
prlllsf\vclcpaprggacrgrqasls 
cgglhpvrasrllclrkrawamasvppp 

ASriP'PCSLXSDCCVSNQ*DSTGRV 


1187 


15088 


A 


1195 


5 


371 


LFSTNHGDFGPIjYLFP\A*AGVLGTDLS 
LLIRAEIiGQPGNLIiGNDHMYNGIVTAHA 
FVIIFFIVIPIIIGGFGN*LVPLIIGAP 
DVAIPRrSNISF*LIiPPSLLLLLACAIT 
EAGAGTG*TVY 


1188 


15089 


A 


1196 


76 


404 


PTPLRTHDQSSKVSRYKVNIQKVGAFLY 
MLSTRTTGI * \ IKSTSFPFASPKVKYIiT 
INIiTKCVQ/DLWKEKKL/NEIKEDQNK* 
/NVPFSWIKGPNIVN 


1189 


15090 


A 


1197 


50 


334 


ILHMVSIPSISIY*HLLPAG*AGTHIG* 
LPPA* FFWVMG/RDRVLLCHPGWNAWQ 
S*IiWASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 


A 


1198 


250 


1 


QYYLVSSNT*SIIDFLQLPQKCIFTVGF 

SK**CICGFIFKNI*IjFF\FCLFESESC 
SVAQAGVQWQDLSS*QPLPPVFKQFS 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWIil FVFLLEMGFHHVGRAGLG 
riLTSSDPPRPAKLNFL*R*GHAVM\CPG 
WPHE\TCLGLPKCWDYRCE/HTAPKPHH 
FLFFFFFSEKNFIPVPRRGGG/WKNLN* 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=GIycine, H=Histidine, Msoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Prol!ne, 
Q=GIutamine, R=Arginine, S=Serine5 
T— Threonine, V=Valine, 
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X=I.Tn known, "=Stop codon, /==possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MEPPPPG*KGFFCPGPPKYWN*iCAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQSSISGPSLPLPPHNGAGDAPAG 
LiG/YGQGPSGPPWGPS/SGPSPR*ALVC 
PTDPSGAARGGRGG\SRGSCCAPAGPAG 
LGDHRPGGMGEGPAAPPKSS 


1193 


15094 


A 


1201 


86 


313 


PPPPGGYEJHPSFXXXXXXXXXXXXKSLL 

P PGKG PNP PKVGLP P PXXXX VS P KXXXX 

XXXXX^NPPHSPPRAPGGPP 


1194 


15095 


A 


1202 


105 


485 


VQQTTMARIYVCNKPARSTPRFTKQILL 
DVLK/ YIDTQTI ILRDFWTP/LWSVMSW 
RQKTNKDNLDLNLMLGQLDLIDISRILH 
VSTR/ YI PPSSLHE I YSKSGHMLSHKAC 
HNNF* KIE I ILI ILLYHC 


1195 


15096 


A 


1203 


1 


476 


PHFGPPPPPQDPRF*G\GPSTRKGFFPQ 
PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP * RRNP YPKFGS PPLPGKIHQGSPGP 
SGQFR\PPRWGKKRAQPTFWAHPPPIFF 
FP*GVLLCHPDWGTVARSR\PPQPPPPG 


1196 


15097 


A 


1204 


1 


269 


WEFETYNIiGIK/WSKDMKRNFTKEDLM 
MADEHPRRC\*HHLTPPRTAQVKLELEP 
NE/WQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFIiKK 


1197 


15098 


A 


1205 


459 


42 


KLKPML I YHSENPRAFQDYAKSMIiS VLY 
K*KNEAGLRAHLFTAGFTENFKPAVETY 
CSE*EISLQI*LLI/GHPR/SLMEMYKE 
MNWFMPANTTS ILQSMDQGVALTYEYH 
YISNILYEAIDNTENPLIDLGKANAWVG 
PG 


1198 


15099 


A 


1206 


408 


1 


EMKTLTEMSSPGMPTEKVSELED/DIN/ 
EMHRKLQCREKR/ I KRN/E INVQEL* YH 
YKRCNHHVMRMPEKEQRKEE I FKVIMAE 
NFPNLATJ^KSKX'-KEAQRTPERIKriYI 
YIYIS*LQKAKEKESILiCETCQKTSKPK 


1199 


15100 


A 


1207 


7 


383 


LDIGCFWWSDKYSSCYFSLATSGKLKYF 
S FSCL IHCFLVCVCVRAHK\ CI* SRAS * 
CVCVHKVCI * SSAS * CVCVCARTNGI * / 
CQCFIj VCVCAQV/ C X * S S AS * CVCVCTS 
GI * /CQCFIiVCVCVCVC 


1200 


15101 


A 


1208 


405 


214 


ASAGRSLCPGGGGCNEL*SPPCPSAWrVT 
\SETLSQKQNPHHHKKKTGQVRGQSSYL 
PLWEVNR 


1201 


15102 


A 


1209 


384 


2 


G VTHH/ ARARFFC / LLDTGFHS VFQAAV 
Q*A\NHGSLQPQPPGLKPSSCLSLPSSR 
DNK 


1202 


15103 


A 


1210 


403 


2 


VPS IQTNGKSLTYFDFFFKQLFKTKLKL 
DASFVNVCVNVEPFRTFALSKTQSLLCL 
SLFLLI F *T INC3SrFCFLRQN/LRS VAQA 
GVQWRDVGLiLQPLPPEFKQFC 


1203 


15104 


A 


1211 


156 


405 


ESKQMIiIQCKSQMLYC*RI *TV/ CLCVC 
VCVCVCVCVCLTCGTKRVCI VFLP I WL 
NCK/PLCEPCC*CNRQGWAPWLI,PVIST 
RW 


1204 


15105 


A 


1212 


70 


408 


KAFSLIiPPGVSPGPIiCNPRRPNFIGGGP 
KKKPPPPPQRWTQGFFWPPPFFFFLWG 
G*GQD/PPPVAQG*MDPPPRPPK\GRGP 
RHEPPPPAF 


1205 


15106 


A 


1213 


410 


3 


KKSMPGFKASKDRMTLIiLEAQAAGDF\N 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I^lsoleucine, 
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X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LKPVFI \YHSENPRIIiKiSrYAKYT/PVIiy 
KWSNKAWMI ANLFI A* FIKYFKPTVETC 
YSDKKIPFKILLLIEWAPDHLRAIjMEMY 
KEIHAVFMPANTTS ILQPMDQGVILT 


1206 


15107 


A 


1214 


3 


340 


KIRVEVNKMYNRKliLEEIKKIKSLFFBN 
INKIDQFI.PRLRKTRQKMQINKIRNEQG 
GI SINIME I KYS YKEML* AI I CKKIiNNL 
HEMNKFL /EHRVPKLTQVE I ENLNYI YK 
K 


1207 


15108 


A 


1215 


1 


57 


RGLKIQNVlSrEIKS*CFEKIKIDKFLiARQ 
NFKRKRFKFRD\ *KGDIMADITEI*GI I 
RAYY*QL*GNyLENLEEP«GKFLVTYNLP 
KLNH/DIENLKKPVTT/REFKSVIKSLP 
LKKSPLHDSFTAQFyQTFEEE*VSVLKR 


1208 


15109 


A 


1216 


350 


1 


Q S S FFS I YPNFHLLS FLFCFKDFIjFFYA 
SVIATNSLPLFFPQISRYFPXPFERERE 
RERENWIi\CCL6WSAVMQS*LTPALNY 
W\ VRQSSLLSLQAS *S YRCGPPHPANIF 

HFIFCR 


1209 


15110 


A 


1217 


139 


358 


KNTFFVWEAKRLFVPQAGGQGGNLSFK 
F PAPP /NLRG* S AS S PS SGDYRPTPPCP 
ANFFFFKKNRISPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGS L KP QP PLRS KMPP PLAS LVTGT 
Q*LG/HPVAPFPIQLIFFFFFLRRSL/D 
S IAQAGAQWCDLSSIjQAPPPRFTPLSCL 
SLKKKGNPVPVITWMNLEDIMLS 


1211 


15112 


A 


1219 


1 


361 


LKQHVSNVE KTAF Y WKTQS KTF VARE * K 
GE ILSSNRGECLLQS FKGQPLLLGANAA 
GD FKVK\ PMLLDHSEM/ PKALKNYTTS 


1212 


15113 


A 


1220 


292 


362 


HIPWPAIQQAEVGGSLEPRRSRP^^PPR 
LK*SSHLSLLNSWDYRNVLP/RIJ\NFCI 
FIiCPVR\FKLLGSNHQNASVSQSARITG 
VSHHAWPILKLKKLFVCLFEMGSHYVAR 
AGLK/PPGIjK*SSRLSIiQNCWESRREPP 
HPDTIH 


1213 


15114 


A 


1221 


372 


3 


RLFFLLPPRPKGDFFPTLLIW*RHGFSP 
PHVFKPPPLNLILGALKKKFFLPS /lpy 
VKFYFFKRAPLFFF/IFFFRDRVSLCWS 
* PPGLKOYTHLGLPKHWDYRHESACIjAN 

i^hnnnntnfffete 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLiLSLTFVlSrijSIjI 

cml*tlslifvkiqklawhggvri»*sql 
/lrrlrqenhlnqeg\dcipawatekds 

vskkkkkkkggpf 


1215 


15116 


A 


1223 


379 


19 


HMQILTIMRYYTPIRMAKMKKIGYTK/C 
W*GCAATGyi\WECKMVQSL/WQNTWAD 
S * KLNTHLS YDIAIQFL/GFYSKE I KAY 
IHTKPCT*MFTATLIIKSQI/MKQYKC!X. 
STDKWIKQNRGIYI 


1216 


15117 


A 


1224 


387 


40 


KKTLSTP \PEKHPTPPLFHPPPQKGGKA 
PPSPKPPPPKHPPPPKGISPPPP*IPPP 
PIYPPPPPAKPPPDFFL*SPPPPPPFSP 
PPPP*TPPPFFSPPLFPFFFFFFFLIiIR 
LLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVRLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL/SSQIQQYI 
KKLIRHE*FWF/IPGIQGWFNILKSVIV 
TYVNK\QKWKKHTI ISVDTEKAFDKIHY 
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X=Unkno^vn, *=Stop codon, /-possible . 
nucleotide deletion, \»pos$ible 
nucleotide insertion 














LAIiILIKKKKKKKRALPQSFIPGDLFKK 
PNGGFPGLKMiyiGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ERINHTLFFLAEAQKRLPTGISGRGRPA 
IPHNTPQRAPDHAYLPAAIiAAQHRAGGQ 
QAPPPG\SSPSSPYDEVKDREGDVTASH 
GLRGNGWGSP^ATSLVUSFNLMYMTAKYG 
DEVPGPEMENAWNAIiANNEKW 


1219 


15120 


A 


1227 


3 


238 


DAWVAGHDG/RTP*SQLLRKLRWEHRFS 
SGDRVSDP * SCIiCTLiAWVAE * DS SS KKK 
KGGPFKGTKFNSRGGGRNYFFYGA 


1220 


15121 


A 


1228 


207 


2 


NRVSPCCPVQWHDHSSLQ/ PRTPGLKGP 
\SASAF*VAGTTGVHHHAQLIEHFFFFy 
*DRVIQAGVQWRNL 


1221 


15122 


A 


1229 


15 


413 


RKSVNVIIHHSEILKKKNYMIISIDTEK 
SFDKX * YPFMTKLSEN*DSLNLiINNI Y/ 
AKP / VANT I LHS KRLNAFP I KQGCLQGC 
\PPFLFISriILEVLASTKGRKEIKSLQIR 
VEEI/KPLSLFADNM/IVYIENPKES 


1222 


15123 


A 


1230 


363 


1 


AADP FACRP VS PHPPL YLVTS SQSLIiT S 
VS PKKQPPS AGMQLGVGDSLS GWGWGRT 
KKTRIiFFFKTRSLSAAQTRRQWCEHRSQ 
P* PPGLIKRSSHLSI»PSS*NHRHMPPHIj 

AN\LCLFV 


1223 


15124 


A 


1231 


3 


402 


QANSCIFSRDFGSVGQAGLEPL.TPGDPP 
AS AS PRCRDYRC \GPHAQLLIiTFC* I S I 
LILV/CIS*MKDYFITCIYFFNYS*QII 
FYRRASDFFPFLRQGIiALSPMHDHGSLQ 
PHPPRLN\HPPTSAKGVTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVITLP/QYKAYSVATVIK 
MVWHWQSDKHVDQWNREPRNRPNYYMSQ 
* FPL * YVSQRNENLPLHKNPYMNVHNGF 
ICNSRKQSRYPSTGEWVNKL 


1225 


15126 


A 


1233 


3 


395 


LP KCWD YRR* P PHPAL F / L F F * KHP KFP 
KMQVK/ WRKH* TENRLVMYFSEKFGD * L 
KDQHKLYYLDMAFQFIFTPTYVMRYS IL 
FKPNIR*LTLSSVKMAVFLVETGF\T]yiL 
ARMVLrS*LRDLPTSASQNAGI 


1226 


15127 


A 


1234 


416 


2 


KIDKLILTFIWKCKRPLLAKIVLKKKNK 
I*RLPLPSFKTYGKCayrV/TK/TKWGWPK 
NRP I KGTGF * VQKKKTPHI YGQLVPDSG 
ARTITN/WY*DN* IST*KRMKLDLYLTI 
YKINSKWIKDLNVRAKTMTFLEENIWVI 
LD 


1227 


15128 


A 


1235 


384 


22 


FFQE I *NAIPVNTQMVRKQNSHPANIEK 
VL/WWrEGQTSWWIPLSQSIjTQSKALT 
liFNSMKA/E /RGDKAaEEKMETCRGWPM 
RPRERRHVHNIKV*GEAARSCGSIAAGS 
PLKHLLLHNTLIC 


1228 


15129 


A 


1236 


379 


2 


SPYTKIHSKCIiKGLNVRPIiTFKPLEENP 
G/VMVPDLGPGKKFIS *APKAWArKTKI 
IQWGYIGIiK/AFCPAKEAIPRVKTWPSE 
SEGI PAGHAFYEGF I FQ I FKGLQPFHSK 
KKKDLILKLDLLSADIS 


1229 


15130 


A 


1237 


1 


407 


YRVGQAGLKLLTL* SAHLGLPKCWDCGR 
EPPCLTIIAL*SVFSLVLPAVIjIKLINF 
F\CRDGGQTMLPRLVSNSWPQMXC/LPW 
PPKVLGLQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ m 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid^ F— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1230 


15131 


A 


1238 


3 


380 


ELSQRTCCMDIMS YYKAIVITMAWY* FK 
DKQRVQQNEVESPKTLLYIYRLWIYYEG 
DTTDKG*TF\WKWYWDSWI/ST/CKNM* 
FDYHFTQCIKIN/SQWM 


1231 


15132 


A 


1239 


1 


396 


FADDMI VYLENLKDSS KKLLELVNQFS K 
VSGYKSKV/NVHKSVALLYANSNQAE /Q 
IKNPTPFTIAAKKQK/ YKQKNNLGI Y* T 
KEVKDLYKENYKTSLKEI IDDANKWKYI 
PCSWMGRIDIVKMTILPKAI *RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMLARLVSNSR/PSRVLPASASQSAE 
IIGVSHyAWPSKLSF*LTIDQTHLSCNL 
FI YVFPERRS CSVTQAGGQWYGHSSLQP 
*TPGIiKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


ShShS FS *DGVLIiCLGWS S TPGLKRFS C 

lslrsswdyrcvpssqtnfWflvemgf 
hhvgqaglelltssdnarlglpkcwdyr 
rvpprpaafflffkg 


1234 


15135 


A 


1242 


2 


397 


nfm innlpkkkalcpwftge fylp fke 
etip*virisiisl/lfsiifqkietegi 
lpnsfyegci iliskpykdi * \enyrlt 
svmhidarfldsilanpiqqcikvihcs 
hvaf v/ sgtqdwfniqksm 


1235 


15136 


A 


1243 


12 


362 


AGFYHVGQDGLKLLTSSDPPASASQSAG 
IKA*ATVPGLSPLNFCEVRFMESWSWKR 
P * RTS \ VCSAIoAKCWDRNASQWDKLPS 
FP T / RM* SP CLHPAVSAFCGS ASVLPGI 
IVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRGGSCL* /liPRCWDYRRELLHLAF 
MPG* LKK/FFCKDGGLTMWPRLVLNFWP 
PVILLPRPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVSSCWPGWS / PNS * PQSDQLAIiK 
CWDYRCEPLHPAFC*MNFLKVSC*HGTS 
/DSKY/CQHVSPKNKDISLCNYNIIIIP 
KKFNIL*YIWFLDFFFFETEFCSCCW*D 
LGSLQPPPPGFK*FSCPSLPSSWVHRHV 


1238 


15139 


A 


1246 


440 


46 


KTEGANINKNTTYQ/NLWDADKAVFRRK 
FRALNAYTRK* ERAQLNLS STLKTRKKE 
QNKLKADRR/QIiyiQI**KLGKVENKQTK 
TIQQIH*TKSWFFEKISKIYEIj*QA**R 
KKGEKTTITNVRNKRYSGRVG 


1239 


15140 


A 


1247 


266 


340 


RKCWPGAVAHTCNLXTLGGQGGRIT 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVALFFFFWGFLGGGG 
riKTGPyFIPQVERRGIiNPD*WNPPPPGL 
RGSSPPT/LPKQWEPGGGPPAPSNFWFF 
FEKRGFPQVTQAGFKIiWN*GDPPAGPSK 
GVGITGGTPSPHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


]y!HHHA*IiVLVFFCGDKVSLCCPG*S*TP 
DLKRLSHHGLPKHWDYRC/RATTPGPFS 
LFFFPPFNPPEFLLPWAPPSYSLKQNKI 
FLKRFLEPLAPPKGLPLKRAKGRI 


1242 


15143 


A 


1250 


3 


405 


QPDSYSPQPGRLSAPPEQEGGPWLILPH 
ACAPSQICVGPHGAS ILSYQERKGTKVL 
SCEGHCKIiSSPVGLVGQSFCWQQPDAVQ 
WVPFRRRTQTPGTVAHACNPSTLGG*GR 
RITS /RPGVRDQPEQH 


1243 


15144 


A 


1251 


407 


2 


P PLKKTKKELP * GPAI PALGVPS FPP-FF 
FLPYKRKEIKGIRTPPMRIAALFTIAKI 
LNQPKYPSVDI *I,HKRHTSNTTLLSRMK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
XJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L^Leucine, M-Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan5 Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, X^possible 
nucleotide insertion 














H* WIKKT* /HTDMMEYYSAIEKNGILSF 
AATGMS PKDIMIiSE I CQAQKNRY 


1244 


15145 


A 


1252 


1 


253 


VADVRESLDPGD*6CSELRLCHCTPAWA 
T\SRTTSQQQLKKQWENTT*KTYGIQLM 
NCQIANYCSKLLYFRKMSNLQSNLKKKK 
K 


1245 


15146 


A 


1253 


3 


372 


KKKAQKSGTIVLPClSrPSYSGG*VGENA\ 
WARSQPGLI S * KQQ * QQQNQQQTTSKTD 

pyiyg/ylihdrggkXntqgwdrlunrw 

GWGIWLMTQRKIKLELYLTPYTRTNSTW 
I KDLNVAGCGGS CL 


1246 


15147 


A 


1254 


2 


410 


KNIjTPIVGI.KAATN*ESIPPQT/SRKKK 
KKKKKKKKRGGALKKKKPKGGGGGKKNF 
. FKGGKKKKLGGGVKKRGEGKKPGGKKKK 
RFGKKS FFSRGGKKKKNRSS SS S S SSS S 

SSSS 


1247 


15148 


A 


1255 


61 


487 


GRPGPTHAFWIDSFFMGNEARFIWHS C 
DPNCEMQKWSVNGVYRIGLYALKDMPAG 
TELTYDYHFHSFNVEKQQLCKCGFSKCR 
GIIGGKSQRVNGLTSSKNSQPMATHKKS 
GRSY/RE*KKNKKKRGGRFKGSQFSSPG 
MQG 


1248 


15149 


A 


1256 


118 


5 


MFIAELFI lA/RRWK* PKCPLTDEWINK 
M*-YSHTM*HY 


1249 


15150 


A 


1257 


390 


1 


TQKNTPTPKPKPTSQQHHQKNKPTPPGF 
FFFSPPGK\GGFFPSPLFWVPPGFFPPP 
VFKTRPPEFIFGAP*KKFFFSPPRSLNF 
FFLiRGPPSFFFFFFFFS*VGKEGSSPSH 

ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 


3 


GAFFFFFPPPKKGFFFTPFFFFGPGFFF 
SPPFFIPPPQIFFFGPKKKKKFLPPPPQ 
KIFFFLRPPPLFFFFFFFFFFFFFFFFF 
FFFL\DLD*NVLSGTSAPEKNQELLGMV 
AYACNP S TLGGRGG 


1251 


15152 


A 


1259 


462 


287 


RDG\FHHVGQDGLHLLTS*STRLiSLPEC 
WDYRCDLPCPAIPAVTLYQIQYRPLGLE 

SKA 


1252 


15153 


A 


1260 


446 


34 


NVRAET/ IKTLEVNTGVNIjHDLS* /GKA 
FLDARPKAQVAKEKSRQPSLHRRLR/FC 
ASKDPIKEVKRQPTE/WEKIWHVSDKGL 
EFKIHKELLEKLSmiPI *KWDKDLNGHF 
SK/ELQIASKHMRRCSASLVIRKCNEIV 
DPD 


1253 


15154 


A 


1261 


2 


383 


GSQRKWFLEMESTPGDNAID I VE VTTKG 
FNYYmLVDKAVAGFERIDSNFERSSSV 
SKILSNSIACYR*IFCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTL I SQQLYTL 
RQDSPPAKRL* FTDGL 


1254 


15155 


A 


1262 


476 


50 


FFFPPSDTGSHSW*AAVQWHDHDSLQP 
* PPGP\SDPPTKSSTRBF 


1255 


15156 


A 


1263 


143 


472 


TGAVPIRPSWN/RPPAMIFF*NAQGILF 
AEFLASQRAILWEFFEKAYHESVLRKSA 
KGLAEKCPGKLHQRVLVQYDNAtAHFSH 
QTRTTKSSTRQFR 


1256 


15157 


A 


1264 


179 


489 


YIFFFLSLFFFLWPRPEYRGAITVHCSL 
NLPGSSDPTASASSVAGTKPH* YI FFFL 
SLF/CFSVAQA*VQGGNHSSLQPQPPGL 

kXyptasassvagtkphpliffff^kkk 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIioe, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninej V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/==possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FL\CFVPQAGGQGHDFG*L*PPPPSLKQ 
GGGLTLSRIGDYRGPP 


1257 


15158 


A 


1265 


1 


536 


FRGGWGSVRRAPGTASCYIiAIiAGGPPGQ 
CPP AAI S CPS PIiPWGS * TE / P YVPRRPG 
DPSAAP / PS W * VPRPGVPRNVPGRGRVG 
QDGCGQAQDIQAAASLMADTAPDSPGSA 
■ GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHS FCGGQEGD 
IRAFPTLYLEVYPGPP 


1258 


15159 


A 


1266 


403 


2 


XSLHPRGYMRLLRQGFISAPCGYMRLPR 
GSSIiHPRGYMRIiIjKTGVHRCTPWVHAPS 
QTGVHLCTPWEHTPS*RFISAPLGAASH 
VT/GSLSIQHIYIPVFCLLRQGL/NSVT 
QAGVQWCNLGSLQPKLPQAQVILTK 


1259 


15160 


A 


1267 


2 


401 


FVLNPGGRSCSEPRSIjHCTPAWATRAYL 

qlgkkkkekkkergeeaslknqgpprplkr 
evgtpgqkkpl/wggsngagqkqpa/qk 
kggkkkadhkgqr6kn*krkregggsts 

KNNSRGTGAKA*NPTIWGGGGKKI 


1260 


15161 


A 


1268 


49 


416 


lrgraldprllrecgdlgappapevalr 
agtcwtr/cti»*apprga/drspwpprs 
pmckag/decqdgipg\mkawscglrtr 

QCPWP*PKIjPCGPGHPAWRT* plpqta/ 
CGPGPAAPCAG 


1261 


15162 


A 


1269 


420 


47 


GPPGWASFRLiNFPKARR*GEWKTPGESG 
GAFFS PPGKNF\AGQN*GRPPNPPPPGP 
GKGGIQTRGGAGLGKNPFRFWGGFPNPG 
NK*GGGTKKKEGPPPPLFFPPLNTDFCN 
LIKRDRLGVGAHL 


1262 


15163 


A 


1270 


404 


1 


AEILELKNVIDILKNVSESLNSRIDQAE 
ERISELEDRLFENTQSEESK*KRr*KISrE 
ARQQDLENSFKKANLRVIGLKEEVSREM 
GVESLFKWIIIENSPNLEKHIHIVQEGY 
RAP\FNSNKKTSRHSIINSHTK 


1263 


15164 


A 


1271 


387 


2 


KKEKKKEVI 1 1 FVLLLPLK* FQNIQVWL 
VRELEKKFTGKYW\ FA*RKI LPKPTQK 

rctknkqkrprihaptaahvaiiiedsvf 
pgeivgrrirvkwtqqltrvhsdkaqqn 

NVERKVQTFSGIDKKLR 


1264 


15165 


A 


1272 


1 


393 


PRMGRLPGGGAPHFSDGVAGQRRSSPHR 
RSRGRAEALIiTSQTGRLGRGAPHISDDG 
RPGRDAP\PS*TGWQPGRGAPHFPD\GQ 

pgrgaphipddgrpgrdaphfpygvaag 
qrlqsrhfgrprqaagrwrl 


1265 


15166 


A 


1273 


3 


317 


S S YETKG VMIASFSSREADNHTAPIRI K 

tnasdstefi ilpvevevttg*wkrde i 

escfvfsiaysgkvknnc*lfffp\apg 
lysstemldfgtlrtqgkkif 


1266 


15167 


A 


1274 


92 


368 


lcppggragqgqnln*gvpsppgpkgfw 
gptygggggkrttrpgggkpeks pd/gp 
pgpppggtkgnpapkmffaqkergpffq 
nggpgnklf 


1267 


15168 


A 


1275 


401 


3 


DMCDWFKKEFSDTTPKA*SIKEKQLTSF 
IKIKSSCSPS\KNTIKGLTRQEKIiWGKl 
FANHMSDKGLVSRIYEELSKLN\K*KIjN 
P* KWAKQ*DHLIiKKTH* *QICKERYSLG 
KGKITMRYHSTSIIMCIIKTIHS 


1268 


15169 


A 


1276 


298 


426 


GATMLVLLP* LAFDLRQSL/NCFPQAGV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H^Histidine, I==lsoleucine, 
K=Lysine, L^Leucine, M^Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 


1269 


15170 


A 


1277 


421 


1 


VSFKDQKL* INKENHPIKKWKKICNQPL 
PQ IRYMDANKHIKIFSTSIiA2UlETNMKI 
TT/RILWKTWQFFKKVSIYLPLEPAIS 
SYLFTQEK*KHIDGSCSPSRVATAKLLA 

LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


I 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV* SQLIiRELRQKNCLXNGPRSHHCT 
PAWAT\EQNS I * KECKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFSSSFFFFGGGVPPCPPGLRPG\PN 
FASLHPPPPGFGGFPSLTSPRA*NWGPR 
PPPQATLGAPGLEGB*IFPPGRPPBP*L 
WQPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFIi*TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQALPPSQEVNDAIKQMK 
KHLKIiAK/DKISIVRCITKMGM*LLGYR 
SSWQVKRITWQLIVIiWMLAYRAVFGI 


1273 


15174 


A 


1281 


236 


2 


TQSRLVFFYMTGPAVYLNHHALRTHQGS 
HLCFFFCFEMESHSVTQAGW\VWWRDLS 
SSQPPPPRLKRFSCLSFPSS*DY 


1274 


15175 


A 


1282 


37 


410 


KKKTLKKIKKLCPPGVKGEFNPPFYVIiS 
KVFPKKHRGLFFKVTFVSLTPFFKKKNF 
KlGGGSGFPPPIiWFFFFFY/RDRVLLCH 
PGWNAVTRS*LTTSVDSSDPS/CLRLPS 
SLDY 


1275 


15176 


A 


1283 


404 


232 


LSSWDYRCLPPHPANPLYF* *RRGFTVL 
ARMV/GPRDSPASSSQSAGITGMSHCAQ 
PG 


1276 


15177 


A 


1284 


3 


402 


MQI KI TMRNYCTP VRMP * KKK\ SRTKPR 

g*ggggttk:ilipcwgdykige/plw/k 
s vwqpl r k*nnhll ydpailpl i fytre 
mktyvhtitkygnhnek/cmqgftealf 

/TLPKTE/SQPKHSST 


1277 


15178 


A 


1285 


1 


253 


LRGKFMTLHS / S ILKRG*SQINN*TATL 
IK*KKKGKIKPK/RSRRKVI IMNRNK*D 
*KQimiSrKFNETRSWFPKNMKKIYKSl4VR 
VT 


1278 


15179 


A 


1286 


400 


105 


FNFKKKKKKKNREVSE*QY*QAKYLKRN 
FTKN I * VAEKHMKNAQ YVI KEMIi I * LTM 
RYY YTP I K\MAKI KLTDNHKCQQLKP S Y 
MVGKEFGKFLIKLNM 


1279 


15180 


A 


1287 


400 


119 


TVPHHIGQAGLELIiTSSDPPVSASQSAG 
ITGLSHW\ PASSS PHSNHS CPVPLBPHQ 
VAQLDSFGYEKVCSFSN*VPGAQDKNDL 
SEN 


1280 


15181 


A 


1288 


157 


386 


NDLQFHPFYCLF/DLFCQSLKKAGMQWH 
DliGSLQPIiLPRFRRIiLCLSLPSSWDCGH 
VS/PMPG*FCIYTHIYFLVEMGF 


1281 


15182 


A 


1289 


400 


124 


FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PHFSQ* rG*KEGAPMGQPlFVSFSKTGA 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNFFFLERGFFFFPPGGRAGGGF* FPPP 
FFFQGKKISGPPPPKKRGPPGAPQKL/R 
EIFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPKRGGSRVGPPSGGPNLFNIiGGGW 
FFRFFFFFFFFFLVFETEFRFCCPGWSA 
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SEQ lO 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptc)phan, Y=Tyrosine, 
X==Unknown, *-Stop codon, /-possible 
nucleotide deletion, Wpossible 
nucleotide insertion 














M 


L283 


15184 


A 


1291 


397 


39 


WGQVWAKDWIHGLGPKTQGCPPPLGVFG 
CP /SL I PPNLPF YGI PGGLKI FVHKNPP 
FPIKKEPANWIATFFPPGVPLFFL/CFF 
ETRSHS VTQAGVQ* RDIiVSL * PTSGFKR 
FSCLSLPSSR 


1284 


15185 


A 


1292 


399 


1 


LNFFFLHRQKLCN-irjGSDDiCVPAFS I YP 
EPRICYVFSLLFEIIFSFDPNKKIRIYK 
KHVSRIYKDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIRNC*TIiAIK/EMQ 
NTTTiyiR*HLMPNRMAKIRKTI 


1285 


15186 


A 


1293 


412 


1 


ARMVSISRPLGTPIWRSQKGGNKRVSPP 
TRPNPL.FLKNPKKCPKSRD*NKPPLLKK 
KKKWSGPCPAQ/SCPGQWPHP/SPPKQL 
PPSPSTPPSRPSPTLAFPMAS^LAISPA 
HTHCCWRELSGTHTLS I P CLKPLRGP 


1286 


15187 


A 


1294 


229 


1 


FFFETESHSVTQAGVQWCNPGFKRFSCF 
GLS S S WD YRYAP PRP \ AMF \ * FLiVETGF 
YYVAQAGLKLLS PGDLPALAS 


1287 


15188 


A 


1295 


401 


2 


STLPVH*KWTTKAWMTAHIi/ FTAWFTEY 
FKPTVQNYCSEEKIPFKILVLIDNAPGH 
PQALMGMNKEMS WFI PV/NTTS ILQP \ 
ADQGVIFTLKSYCIRNLFQHQCTPAWT 
QLDSVSKK* INI FQPGMVAHACN 


1288 


15189 


A 


1296 


1 


336 


KKTPRRKTHKEHHNGEKX LRAFPLRSG I 
KQ/GCAF*P/YILNIWA/VLAKEYGKE 
I KAS FRKEELKIiCL FIDMI I F VE I PENS 
PQKL\ IN* FSKVAG* * VSTQNSVAFLFF 
PFFF 


1289 


15190 


A 


1297 • 


3 


384 


HTNMQKVIiV/VKTEDQPSHNIPLNQILI 
QNKSIiTLFNS IKAERSEEAA* EKCEDSR 
GWFMRFKKKK/RISITKMQGEAASAWQ 
AGTSYPKDLAK/DEGSYTKQQICWVNET 
VLY* KMX /PTEDFWRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRSVTRAGVQWHDLGSLQSPPPGFKQFC 
LSLLRSWDYRLSPPHQQMGGIFL/VFLF 
PFFPGKGVFLGGPQAGGKGPNL6*WNPP 
PRGLGEFFGLNPPRGWGFQ/HPPPHPQL 
F/CCFFRGKGGSPRRAGRAPNP 


1291 


15192 


A 


1299 


260 


4 


GFIVSSAVFPLLKCLLDILVSSSVLLYC 
LWNFKIQKSFRFYNLSPLVSSFKNSH*N 
SNI /WPGAVAHACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRLSAWDY*HVPTHVFFVEMGFHRWQ 
ASLELLS*GDPPAIiA 


1293 


15194 


A 


1301 


277 


441 


TVEF*GVQSMKEEI \ELCLFEDDMSVYE 
ENLKELTKN\LLK* I SNYSKVAGYKVNI 
*KAIAVLHTSNEQ*NFEIGNTILF/TIS 
TPKYLGINLTKYAQGLYEENYSNLMNEI 
KELNTW/RDILCS 


1294 


15195 


A 


1302 


256 


482 


YNVYFKICIGPGTVAHACNPSTL6\G*G 
GRI/TLRSGVRDQQVQHG 


1295 


15196 


A 


1303 


182 


460 


VGREFLDMTGKEFIYKFLYIRKIN^SSS 
KLKTF VLPKTLLRD* KAKLQTGRKYL* I 
TYPVKGLVSRLCKKLSKLNSKKTTQIjKK 
WAKDMNRHF 


1296 


15197 


A 


1304 


1 


468 


FKQFSSLSDLSSWDYRHPTGNFCELAPY 
KQNAPS CSNVFTD3WP VI ST * QGLQALS 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C==Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F-Phenylalanine, 
G— Glycine, H=Histidine, I=Isoleucine, 
K~Lrysine, L=Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T-Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possibJe 
nucleotide deletion^ \=possible 
nucleotide insertion 














PNWEQSPWRETKKTAVDLCGVIWKPCQ 
VFLRKKPKTKTGQ /HEQQI FGF / CFFSE 
TESHSVTQAGVQ*HHLGSL* PLPPPFKR 
NSTTRTSDYDGP/HVPP/HPA 


1297 


15198 


A 


1305 


880 


1026 


EEHGAGLSGSQDAAGGVPAG*GGWAQLW 
VTRRASLFLDKTHWPVDEQNLGSLYTIE 
ATAYGIiMQKLELGRYNETHAIAKWtXEK 
QELGGGFRSTQ/SDGDPRETTWAIiEAL 
TRFREAVPFKGIQDLHVQIRAPKTALNV 
NWYIDHSNAYQQRSAKFLAQDDLEIKAS 
GNGRGTISIIiTMYHKSPESREDNCNLYH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN*EATMTIMEVSLIjTGFYPNQDDIiKQL 
TSDVERYAFQYKTKTSTSDSTWLYLEK 
LSHEKNTELGFRVHRMLQAEFLQAALVT 
lYDYYEPSRRCSTFYNLPTEQSSL 


1298 


15199 


A 


1306 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVEIiYEGYGQSETWTYKWELSYE 
DANTY I VK* KTIi* TQKBEG \ 1 1 CANPKG 
MKIKSVSMVKESLPYYVHIVDDEGNVLP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS * QGDFY ITGDRARMDKDGYFWFM 
GRNDDVINSSSYRIGPVEEESALVEHTA 
VDESTWSSPYPIMGEVGKAYIVLTRAY 
SSHDT*ALTRVL.QEHVIK 


1299 


15200 


A 


1307 


230 


486 


NAICPWGSKGSLGIiGSCSVTQTGVRW*D 
HSSLQPRPPGLK\YPPTSAS*VA\GPQA 
GVQWHGLDSLQTPPP6/FKRFSCLSLPS 
SWDY 


1300 


15201 


A 


1308 


463 


440 


SAIPVHT*RRRKQTSLNADMEKV*WWI 
EDQISHNIPLiNKSLIQGKALTLFNGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
WKVQGEAAGADGEAAASYPBDLAQSTD 
EGGYTKQQFFSIDV/TTFYWKKMSSRIF 
IAEFHHTD*GDNGSVP 


1301 


15202 


A 


1309 


463 


59 


EVISTLTKMGKILKTKNAK/CWQDCGTK 
RTFIHCWWGYKLVQPLWKRV\YTFPYNL 
AI PL IG I YP /NMKI YSHKRTCTKMF/ I 
NSLFIISKNWKHLRYTFTREWIK/M*YS 
HTKDYYSAI KRNKLDMFSNMHE SQ KHYG 


1302 


15203 


A 


1310 


438 


29 


PGGKGGGGPTPAPFPFPPPPPKVGVFW* 
GFPRPFLFKPPPPNKAPFPGAPHFF/SL 
PPFPRVFKECTFPTKKKFFPPGGGGKKKP 
PPPLQGGKKMGSALKFPSGGKKPWGISP 
QI 


1303 


15204 


A 


1311 


2 


310 


hnqknefgpyvts yvkvkeak* laktrk 
lleenigvnihdielgsgflammp\ea* 

AI RLK I E KLD F I KI KNS CASKDT UsTKVK 

ktgrpgmvahaynpstlgg 


1304 


15205 


A 


1312 


19 


348 


rgqqdvlpg*rgqqdvlpgrgtyseygl 
ifsarnpsmev*svwhrpfhhhgvp/hq 
ncsdsgs ypiskecgn/ wsacgih* cc 

LVPCLPKAADLGE * WNGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


WKhTNHGKTGNGGAQSQ/ PPECEPRTH 
LQRGLBGRGGGEPWGQGQPD\PPSWSSL 
KNPQAGVPPFSSLKGKVKRDRSVSVDSG 
EQREAGTQ*TFVNGKKKKKKKKKKAKKA 
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SEQII> 


SEQID 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, I>=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GIutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=^possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














AYSPDSKA/PSSKSSPKSS 


1306 


15207 


A 


1314 


375 


3 


KKRTLLIPL!yrNMDAI/MNKILAN*IQQH 
IKRI IHMDQVQFVPGMQGWFJSriQKNINV 
I Py Y* LFY* CYINRI KDQTLI 1 1 SIDTE 
KNDKIQNSFIVKTLRK\IKENPLTLIKR 
IY*KSSNNKKNID 


1307 


15208 


A 


1315 


403 


3 


LEGVSFFFPRVASQGSILGSCNPPLPRF 
HHFS/CPHLLSKWGYRFPPSPPA\FFFF 
FIjKTGFFFFCKNFALCSFP*KQASPPPP 
LTVFFFS/HHPPPFPPFPLFFFLRWSFA 
LVAQAGVQWHDLGSLQPPSLGPKRF 


1308 


15209 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 

aaapsek*gasppgshpiwevrsvsarq 
pprpggrwwgvnpppgqpprpggegrlc 
paaptgk*gaplpgqpprpggrwggypp 

arsar.sygr*gaplssrp 


1309 


15210 


A 


1317 


43 


399 


LTFFFFLGEGARPPPPGWGPGAHQGITA 
PLFWGGQGNPPP*PPGEGEPS/QGPPPP 
GGNVFFWKKKGSPRAPGGPWTGGPKGIiP 
RPPPPKGCE*RGNPPPPTPKFLGFTTFQ 

KKGKNSGP 


1310 


15211 


A 


1318 


437 


54 


MNEQKGERGLS SLGIiPAR * D * GKGNVKE 
GR/EGVTFSPREPKEESLRWSTPQKEIV 
GVINHPRGQRVDRQGAGSGCEGFALRAQ 
TGGP PA* E ITRAGEQKAGGGS KGAQTLQ 
RETRPPRGQRGGGRSASL 


1311 


15212 


A 


1319 


351 


133 


GGGWAEVPLTSQVGRSGRGTPQIiPDGAA 
RQRRPPPPRRGSWAEAPTSQTGRPGRGA 

PHLPDDGRPGRDAPHLP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKLA* /DQNR 
HFIEEETQIPNKHMKRSSIS/LAIKEMQ 
IQITKSYCIIIRLCK^IilKNSDSIKCWQ 
GCREI /GStalHCWRKLKMNSTALQMVYK 


1313 


15214 


A 


1321 


485 


33 


cpasrvag i tgahhhawl i fvflvemgf 
hhvgqagle/lpvswgngghl/rlcllf 

GYCGQCCYKPS*RRFFCVSFYIRA*K*G 
FVCLFV*MVSCCVTRLEFWTHRCNHSK 
LQP* IPGLKCS\PASGF*VARTTGL.YHG 
VWHVSNSTAIiTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVPKVRKLxNPPWLKKENPL 
FIKKKKKKPGPGGGPFIPTPSEG* /GKK 
KGPNPEKEASNKPKFAPSNPQTGLGKKK 
KPPSLKKKKKKKKVYSLIF 


1315 


15216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\SBTPFKKKKKKLCISWPGTVESCFSVR 
T * * I * LRQYVLYLFLFQPKNVI YLFICL 
HDKMSIMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


TWEGEGSWLTSQDRTTAPLHPSLDNRVK 
LRLKKKKSVGFLSPS ILIiAKNQIKKPAP 
FTMA/SK/RIKYLGINLTKDVKDLYNEN 
Y* TLMKNLKGGAMCSFTSL 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS*LTL 
PSSWDYRHPPPRPVNF\*FLGEM6FRRV 

AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP*LPK/SLRC*GKDFYNQGSGQA 
RWLPP VI PPPLEAQAGGS P *GRS LRPTW 
PAWSNP S FLKKPTPLFLiKFS * GRRI ALT 
PKAKVSVNWDSPPALQPGGPSKTFFPKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alajiine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
K=Lysine, jL=Leucine, M=iVIetfaionine, 
N=Asparagine, P=ProIine, 
Q=Glutaniine, R^Arginine, S=Serine, 
T=Threonine, V==VaIine, 
W=Tryptophan, V=Tyrosine, 
X=Un known, '^^Stop codon, /=possibIe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














KKKLRI IGPGWAHACNPNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFGIYFCVCSLYFTCLYFPFLIFL*V 
TLTFLVF-SfFDFTVLFIISIj^YFNDCSRD 
CNIHM*LLSLPVLIFHCCC\FVFRDRVS 
liCHPGWSALASSLL 


1320 


15221 


A 


1328 


2 


392 


RQASRPP * S APPAPAGKBGSGE * PPS PK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP 


1321 


15222 


A 


1329 


378 


7 


TDDLIIiKFI*ERyKPNQP*KRTKLKDSL 
PDFENYYKAKVVKTRW**YKE\KHIDQW 
NRI ESR* QI FNIAS 1 1 /HIEKE YLCaSTKW 
CYRTT3CMEEKNMNLKPI,P*TIKFNSK* I 
TDLNIKTITIWLKL 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELiIjTSGDLPTrASQC 
WDYKHEPLRLGELRSLTAAWAMQQDPVS 
PNK* IKrK*l/P*SEQFSGI*S IHNWQ 
NRHFYPDPTHFHHSKVKPLTHYWSPHS 
FLSPTPGNHQCLLSVSMDFS IIj 


1323 


15224 


A 


1331 


1 


389 


KMKSQAIDLEKIFTKHISDKGLGYGLYK 
ELL*FN/R/RCQTTQIjKMGER*TNSSTM 
GMS FEWMVSKHMKRCSCAFPSLVTREMQ 
VH/TTSLFVERYHYTPTRVAVIKQSDQV 
\W*ECGIRTLIHC*WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVPSPPP 
FFKPPPPFFFFGPQKKKIFSPPPPLKFF 
FFLRPPPPFFFFFFFFFFFQKNLGKGFF 
F*PRPEKKKKPPPGGVFFFSLFFIjRCSV 
ALSPRLECSGAISAHCNLCI 


1325 


15226 


A 


1333 


394 


1 


SQWGPTAIHYCAWL.VFKFLGHFSKTTI 
S APLLK* TSTRAETS KS FS \ CPRKMERA 
EGRRMFLGKSLLKQIVLLF/lilSGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDLGS 
LQPLSSRLKRFSCFSLPSRTRG 


1326 


15227 


A 


1334 


416 


1 


LTLLPKPTPYNTRKENYRPISIilNIDAK 
in/NILAGKIPQYIKVHN\NMGLTPEMQ 
GLFNI * K/RKSVNVI CHINRKEEKKILL 
INA* IVFDKNPTMI *KNSWQ\GEYKEI Y 
SNILLNGKMLKACHL\KTRIN*GCQLSP 
DAW 


1327 


15228 


A 


1335 


391 


57 


WQRVCWWDKNLYQKEKRQDNVREK/WAT 
DLI INPTHEEMQSVNNLQKDTHPHSLVI 
REMQIKTTSY/HLXiAKILNSDNSSYW*G 
CGGMGTL* CCWWVSKEIHALWEIVWFl Y 
SK 


1328 


15229 


A 


1336 


285 


2 


I FFFPWRGE I I * HLSLINKGSRQNERAH 
\KDNGDFSQI ISKLKLMYKYKILNQTTI 
KYMFFPKKHRTSIKIEHILGSKGSLNNC 
QRISVIiQTGRV 


1329 


15230 


A 


1337 


34 


391 


AKIASLYSSIiGNRANSVSKQNKKTKNQN 
LKN/IKTAGGLGWAHSCSPSTLGGLH\ 
DPGV*GCREL*WCRCTPAWASG\ETLSQ 
K* Il^GKERKKKMKNCWWMNNSXiIQLiTS 
FSFTKTDWSS 


1330 


15231 


A 


1338 


292 


1 


GGGISKTPGGEGAINPKKPPALPPGGKK 
EAFSPPQKKKKGERA*RDISSEDI *MAN 
KHMKS CP PLM I REMQ IKSTMRYRFIP I 
RME\IIiKKQKNS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartlc Acid, 
E=Glutamic Acid, F=Phenylalanine» 
G=Glycine, H=Histidine, I=IsoIeucine9 
K-L,ysine, L=Leucine, M-MethioninCy 
N=Asparagine, P=ProUne, 
Q=Glutamine, R=Arginine, S«Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y==Tyrosine, 
X=tJn known, *=Stop codon, /^possible 
nucleotide deletion, \=:possib!e 
nucleotide insertion 


1331 


15232 


A 


1339 


369 


16 


DPWQNEQLSRVTCPGPRRASMFRIT^YK 

VI iaalmaysvgpravsciralwttygX 

IMDNYjECRHERRFLKPL*WPGMVAHACNP 
STLGGHGRHERVGRVRSIPDCWRVAAG 
VKASI 


1332 


15233 


A 


1340 


394 


1 


TQSGVQRHDAGSLQPQPPRLKQS PHPS L 
PS^QDHRRVPPCPARPGSSHDSYVLREG 
PCGRW/HESRGRVFLVLPS* **LSLiMRS 
GGFKKGEFPCTSSL/SCLPPST*DVTCS 
SIjPSAMNVGPLQPHI FMHGF 


1333 


15234 


A 


1341 


400 


161 


KIGPNLPCPSTKRPWKIDKIQFWPPGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPI,P 
PLPKSPPPPKNPNLPPSFGFDIPPP 


1334 


15233 


A 


1342 


377 


3 


NRHFPKAETLKASKEILKRHLPLLVIREV 
Q I KTTNI CHN I ATRIiAKM * KADNTKCWE 
S CEP VLTFY\ C * WE YKLVQPL * KMVWQK 
PKVCVPYYLiAILIiIiQHIPEGTCAS/CY* 
DIHKRLLTAMLPAM 


1335 


15236 


A 


1343 


170 


369 


RCNENKITKEVYFGEIFVRLDNKEKNTS 
FFYFYPLFY/CWR*GLALLPRL.VIiWS*A 
QWLItPWPPKML 


1336 


15237 


A 


1344 


385 


2 


SDLRRSTHLGLPKCWDYRC/RATVPGRL 
FIFLMVSLKHKSFKF**RPIYIiFFIjSSIi 
MLLVS YLRNH* L I QGHKDLL PMFS S KS F 
MILVLTSRS ILSLLFCFLFFVLRQGL/N 
S VTQAGVQRHNHGS LQ PQP 


1337 


15238 


A 


1345 


1 


281 


HMATKHMKRC*/SEMCTLKPQLNTTTPD 
HQNG*N*K*LVNFKGWQ*FGAVWNLIHC 
W*DCKIiAQPFWKIDXiSIKAKISMSYSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


S VI WNI KE P * lAKI ILKKKT I VGGIiTVP 
DFKT YYKAMVI /KT V* S WTiKDRQRITQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
F YLG YPQGFPLPAVLNRGPG ILF WGPHK 
KNITLPARGR*IGSS*TAPPFFF/SIjFS 
FLLFSFLIiFDTGSYSVPQAAVQWFMHGS 
LKPQPPGLKLSSQLLRRLGW 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PPKKK/RGPNPNGPVGEFLATGPFFWTG 
PIiKKNPAPPRALFWGPPPPPPWGG*NPP 
PFIiARGSPFFKKFFGGPDPFFYRTWPRG 
PNKRGPPWPLKWGNPPPK 


1341 


15242 


A 


1349 


119 


1 


ENVFRNM/WPGMMAHTCNTSTLGGQ6EW 
ITRSGV*DQPGQ 


1342 


15243 


A 


1350 


58 


396 


GIRVGKVCFIIFFFGFFFFFGKGVSIiPP 
GRETGGPPL\LIKPPPFGVKGIFIiPPPP 
GGGG\CGPPPPPRVIFVF*GKGFPP*GP 
GGF*PPAPKGSAPPPPPLWKNWGNRIFG 
GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGPFLDGQGFPPPTIiKQNFFPPA 
LFFPPLGKRIiILGGF\PKPGGIiLQPPr* 
KPLRFKPG6ESDCPPKFF*GRIVCFQQS 
liCFFPPPKKKKTAFGKRFKKKTIiIiFFFF 
SDGVSLCHPGWSTVAQCRL 


1344 


15245 


A 


1352 


1 


250 


RRI.*SQLLGRLRRQNGVNLGSGACSERS 
\CSERRSRHCTPAWATKRDSVSKKTKCR 
RHQHVGSLTVRLRPGYWIRHCGCHWWP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidirte, I=I$oleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S^Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=:possible 
nucleotide insertion 


1345 


15246 


A 


1353 


249 


3 


CGATLCPKKYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTriK*KQG*RTHTFHF 
HNILQKAIIVKTMWY^YKHRQSPGRVAH 


1346 


15247 


A 


1354 


15 


416 


lltlsssi.kitihtnkgrt*sfireni i 
xfiattnllgllph* ftpttqlfinlam 
aipl* agavi igfrskiknalahslpqg 
tptplipilviietislliqpialaNar 

liTANITAGHLLMPIilGSATLTI 


1347 


15248 


A 


1355 


2 


416 


I KYLISrVRPETLKLL * / ENI E *NPHNIGIi 
GSDFFNLTSNSQGIKEKIDESDYPKLKS 
CCTESDTTNRIRQLKNERKCL.QITCDKG 
IjIF*KKLK*LY/KQKTNNH*K/WSKRLK 
YFSRtCDIHMAKRY/MKKCSTS I IRENKP 
K 


1348 


15249 


A 


1356 


426 


85 


HARLVL* FVF/LFETAYLSVAQA/GGAM 
AQSAHFSLELPGSSDTATSTSQVCYHRN 
T/RLLF* IFCGYG/RLCCLGWSRTPRLK 
OSSHLSLPKCWDYRWEPLYARPYIiSCFP 
ENARLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQHNKTPSL/ IKIYIL 
P I YTQKN * KNLTRHGGMCL *S QLIARLR 
* ED * S S PGI *GCRE P * WRHWTPVWTIQQ 

DSKS RGE KKQVI F THYT 


1350 


15251 


A 


1358 


2 


389 


FLHVGQSGCELPTSVDLPASASQNAGIT 
GVSHRSWDKFSLLDVPSPLTP/VSSAVP 
LIS YIiATGWRQAAI AASP I FLHQLARPA 
QPARE AAADS * LPADS AFP PKQWTC * * M 
FTGAIiFKKKKKPCFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDH I KLICNFCTAK * TINKVQRQPME W * 
K/ 1 FWYPVDKGL I TRI YKELKQLYRKK 
KSNNL IFKMSKS\SS LAIREMQ I KTTMR 
YHLTP VR I LVY VLP KRARS 


1352 


15253 


A 


1360 


300 


2 


KRAFH*KKMPSRTFTAREGKSMPCFK/A 
SGWATPVIPAL*EAKVGESLEPRSSRPA 
WATRRDSCLF*K* * INK*KLTLLLGANA 

AGNLTIiKLMLICHSKS 


1353 


15254 


A 


1361 


396 


60 


HKVTRKSDGMLCHSGFCSINQLAFCCLF 
FQPGGPRGTPLP/PYKPPPQKTPKKTGP 
GGGGLYSPPFGGGGRGNPFFPGAQGFF* 
PPSPPPPPPPGGRKKIPLPKKKKKSLLF 
FF 


1354 


15255 


A 


1362 


37 


385 


AIiFSFSFFFGFFWGKKIFFFGQGGRAGG 
HSNLPEPPPPGGGAPPRPNLSGGGE*RG 
PPIS\GEIFGPLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 
WGKKN 


1355 


15256 


A 


1363 


12 


421 


EPWEPQTLGICTHLQTIaFHDYQVLMKML 
VVTVRYHIiTPVKMPFIRKTFDTAGM*KK 
K:KGCYQGSKKGGMLr/HPPWGAL*MSlV 
KKKTWGTFFKKIQKELPWDPAIPWLGMY 
PKEGNSVFQRGMGLPSVIGTLFPIAG 


1356 


15257 


A 


1364 


278 


3 


CCTGEKLETFLLR*ATRQGCPHI,FDNViyr 

evxanavryekeikgiqngkkkvktNsl 
fpgde iv/ yaenphks * pknsl / klksd 
cskvniqkpiaf 


1357 


15258 


A 


1365 


265 


330 


WPGAXAHACNP S TLGGRGER I T 


1358 


15259 


A 


1366 


379 


1 


KRKLMYCWWECK* IQPGWKTVWHFLKKL 
KbELP YDS VILLLYTFLKES / PVYARDI 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspart!C Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histldine, I=Isoleucine, 
K=L.ysine5 L^Leucine^ M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutam!ne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














CTPKFIATLFI IA\RCSLTHD*TKKMWY 
LSIMEYYSAIKNE / YLPFATTQMNLEDN 
I FSEESQA*KHGVF I FP 


1359 


15260 


A 


1367 


1 


376 


HLPGAESQPPPV/DNSWDRPAGRTQLLW 
TPA/DPHSYG*GGAGPHPCPSQPGCCAP 
VQSCS * APSEAQSLGAADS \GPAATLPA 
RQLITKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKDDSGKG 


1360 


15261 


A 


1368 


2 


409 


EEAMPKAKMG*RPFAPNSQIIJSrAKKKLL 
KEIKSVTPVNI*MIRK*NSLIADMKEV* 
VIWIEDQTSHNIPLSQSQIQSKVRMLFS 
SMKAERREEASEEKLSVSR/GWKSAGFM 
RFKERSNLHNIKVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYTYTLHNDILVNDGRLSPG 
GQGCSELSLCHCTPI*MT\SETIiSQKKK 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*AFHSKGSK* *H*EKKVN/WNFTKT 
*NFCTAICr>IIKRMKRQPTKWEVIPANHI 
YLTGILIS\KIYKEbRTTQ*PETQSLK 


1363 


15264 


A 


1371 


3 


322 


HASERTHRRGKSTETTTPAWATERGSVS 
KQKTNKKTNK/RNTGSSIHNMVSDFEKQ 
VTQTF * S SMQMSNKPLKRYLTSVr lEMN 
IKPEYHFTPIRMAI IKKTDNTKC 


1364 


15265 


A 


1372 


3 


332 


GKE VS * ED I MMLHVYAPNNRAS KHMKQN 
LIEMQGEI /DE/HPVI IGDFS/TPhS I X 
DRSGQKIIKDIV/ELS/STVKQLDLI/D 
I YRTLYLKRVE YVF\ FTSLS KTFATINC 
ILGHKV 


1365 


15266 


A 


1373 


381 


2 


CS VAQAGVKWCNHS S LQP * T PGVK* S S C 
FSLP SHSDYRHE \ PAL FFKFFVQMGS \ T 
ML*FS*ESGI KGSRQ P CS YT YLHS PP I F 
SSLFPSLAPESIKRQEPSFFFEMESCSV 
TQAGVQWRHLSSLQA 


1366 


15267 


A 


1374 


I 


385 


YKKG I NAKI FNKI * * V\ESAI SHDQVG I 
VLVLQGFIiS I /RNKS I S VTHHWRLKN/ 
HMLI * IDAEKAFDKIQQSSHGKIGIKGN 
FFDLLKS I YRKPiyiANVILNSEKMKCLPC 
KC\KTSQGCQLSPIjFFLPFF 


1367 


15268 


A 


1375 


1 


357 


CSGVISPHCGLKFLGSNDLPALASRVAG 
IIGMTP\HAQLIKiSrFCCCWWR*CriAFGG 
*RFKTMCRPIVK*NK:srNK*NRTVCMSG 
CGDS /RQLLGRLRWS PGLRISTVQGCSEP * 
WHHCTSACIi 


1368 


15269 


A 


1376 


1 


375 


HRPKFKSISIiLEENMGESPHDLGL/GSD 
IiLDMAP I VQS IRDKTSDFLE I KKS CSS K 
GTV* IMKKQATDWE /RTFVKHTSNKGS I 
MYEEL*WIi*KL*N]SrPr*KWAREPSRHL/ 
EDTQMATKHMK/KCSTAL 


1369 


15270 


A 


1377 


260 


400 


KRGLGKTVFPWPKKKNGPPFFPPKTTSK 
GRKTIiMVNPE*PKYYSVKKKKKKKGGGL 
PFPCKIFFPNLTIQTTFFWPKTNPLTLK 
KRBOCPPKKTPTFWGHFFFERGTKKTQGG 
KKNPFKKGGWEKIiFFLGQKKKMAPFFFF 
QKPPQR/WRKTLMVNPEP*NLLKKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALSLFPFKYPKPRMVEFLVSPSR 
*KaPSVYGLQIPFSlPKVPFPCPFWAF/ 
SLV*SSPPCLSLHLLPAFGVLPiCRSIiPR 
PMtiQSFSPMFFARIG^FQILLLNFQSLL 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteiney D=Aspartic Acid, 
E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine» 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProUne, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=*Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S*FFFFFGDRVSLCRPGWSAVAP 


1371 


15272 


A 


1379 


195 


381 


AERYQTSRSTADRISCLFI*SAEI/VFP 

DSGKKTGGNKmmWNlSn^ 

ISLSLKP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDWRRFPKQPMTNST* SDA* 
VDEWC]:i*FQLIGRrjK*SRRIA*AQEFK 
AS /HGQHSETLSLKNYLKKKRFSTLLFL 
REMQIKTTTRYHFTPIRMAKK/SHHTKC 
*QGLAMIjPRIiALNSWAQAILLLHFSLPI 
SWNYRHTTSSTQASLHVrjFVIGCLGNRIi 
LTSFAVSFTGG 


1373 


15274 


A 


1381 


400 


90 


LPSSWDYRCAPPLLANFCVF/M* SQGFT 
RL VSNS *PQDPPTSASQSAGITGVSHHA 
WPAFFIMP7FSSS YSR/TMH* *QHPHV*K 
IiHICPlSr**KS\LSCPQVPTIIVV 


1374 


15275 


A 


1382 


1 


431 


DNI PKKSAVYYWI TLFKKG* DNVEDEAY 
SGTLAISIC/EENIHIiVCSIjSDEEQHST 
AQTIAWTIDITVGLAYTILTAKSKU^TKIj 
CT**MPKIiSYPLF*KIL*KNKTKWIV\P 
NQLQIKAELPVEILHNCDQDPETWLGGV 
AHTYN 


1375 


15276 


A 


1383 


2 


432 


elsadvsfffittpiisgvl*qnwgiaaf 
ipielrsptevtfsfdvgngpfeislqs 
pthfsdhqwhhvrvernmkeasii/qvdq 

LTPNTQPAPADGHVIjIjQLNSHLFVGGTA 
TTQRGFIiGCIRAL/RMNGMTLDLEERAQ 
VTPEVQ 


1376 


15277 


A 


1384 


1 


421 


NPPALASQRAGIADICHGAWPLLRLSKP 
QFSNL*SKKFALGDV*SSNIL*YIYIFF 

emesrsvtqagvqrhdlgslqpXlppkl 

SLLPPKLSIiLPPKXSASCLRIiAGSWNYiyi 
HVP PRPANFCI FSGDGVS PCWPGRTRTP 
G 


1377 


15278 


A 


1385 


2 


416 


IFSVDETALYWKMMPSRTWYL/RKEKSM 
SGFRDSEERLTLFL\GLNAPGELELEIL 
IiI*HPEILGPLNYVKFTLSVFY*WINEA 
LITAHMFTAWLTECFKPSVETYCLGEEM 
PFNILLLIGNAPGYPRAPMEIQEINI IV 


1378 


15279 


A 


1386 


170 


1 


DGVFLI FGGQNEKLNKNKDG/DLTKLPR 
LA*NSWARQSSCIiAFSKCWDYQREPPCL 

A 


1379 


15280 


A 


1387 


47 


418 


FWGFFFFFLKKKKAALGPWTPPPPGQGG 
PPPPPPRGPGKGGPNPPRQTIYGFWGQR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWP PAPGP * KGF * GASKGAP * KGVDP 
GGERSQVPNRGA 


1380 


15281 


A 


1388 


58 


503 


RPTRPGNYIMIKEPIFQKDRTI *NVYAP 
KNIAIiKYIKQKLINLKGKRDKIiTLXVAN 
I STOP * VTDITSR * KI CKGTEELF*QPA 
\DLIDIYRTLYPTAA*YTFFSSAYNYSK 

IGNSIGHKTFSN/ CKRNDNIQWIjFSDHN 
GIK*E1NIII 


1381 


15282 


A 


1389 


2 


401 


liVSQWNSKNRILKKIKWSTPVHT/RQM 
IRKQNNIiIADMETVIiWWIEDQTSHNI \ 
SQSMTQSK?^TLFNSVKAERGEEAA\EA 
GRGWFTRFKERRQL*N]:jQVESEPPS\AD 
VEAVAS YPEDVAKI IDKDGYTKQR 


1382 


15283 


A 


1390 


294 


3 


KI VI FDFNDVKlSrCS S * KI VIFDFNBVKN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E5=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l^Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P==Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CSSKTVIEGKVSDCiCKIFSKHISDKYLI 

siiYKEh3mjjmsiQQPN*mAin:)ij/mM 

SSKHRKI CS I S FGKMQIKTQIRNHYI PT 
DAW 


1383 


15284 


A 


1391 


3 


422 


PLFKKEEGTNFPFPPPWGRFFPGGKSPS 
PFFFPPPPFIKGGPKVSQKGPRPFFFFF 
F*DGVSLCRPGWIIiFPPQVTEAIiIiRGFL 
AIjSSIiCFRLDKFFFFF*R*GHLIjSRWDy 
RHEPPHLRIS*RKTKE\KGLIMIiTRLVL 


1384 


15285 


A 


1392 


3 


400 


FLiYIYKDIYR*YKTPQINNKLKIIYFK:* 
EKDLSKLFTKDVQMTNMHI *MGSTSLI I 
KQMQIKTTMKHHSSLPECLHFNAANITC 
WKRLGPAGTL ILRW * ECKS VKPLi/WETF 
^QCLIQIiSMQENYH/DPATPIilG 


1385 


15286 


A 


1393 


409 


2 


IVRHFSKEDIHSTNEHMKKGFLSIiVKEV 
Q IETTM6HHHI TVRRVKI KTDI FASA\N 
TKC* *GYGKTRTLINS/YWECKI VQPL/ 
WKNILAVAYKIKHTFFPPRIHLSGSQKS 
HPS lYSKEMKTLRQHKDIPMATIiSQN 


1386 


15287 


A 


1394 


409 


24 


KSPFFFFEKGFPSPPPGGAQWGGFPPPE 
PPPPWK*FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPIjPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG*RAFFCPPKI 
S KKLWSGKKKKKGRPLAI 


1387 


15288 


A 


1395 


83 


411 


QRDS VS KTKQKKINNKSRTDLNRHFTKE 
DRWM/ANKHLKRYST*LVIREVQIKTTM 
RYYYISIKKSKINKPDHTNCWQGCGRPG 
TLIYC*WE/CKMVPSLK*TFW^FLiKRLN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNIIYCQKGTAFTPISDKID 
FW*KIISRQK*QYMLIKGSIKN/DITII 
NTYTSNNSSPKYIKQTLTGLKGEIAPST 
IWGVPNTTLSIIK 


1389 


15290 


A 


1397 


321 


1 


KQ*CSIEYS/FKYTR***SKL/WPGAVA 
HAYS P S TLGGRGGR I T* GQE FKT S LA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVNHACNPSTLGGQGGRI /TliRSG 
V* DQHGQHGES PSLVGRVR 


1391 


15292 


A 


1399 


18 


422 


KAGMAILIDKVDFRTKNriTE*/EKHFI 
ITKGSVREENISVLNV/WPHYRASKHMK 
Q/ILIELHVELGEHTIIVG/DFNISFSV 
LNRIDKESARRR*NI *NNTNHQLNIjVDI 
Y* KTKNTTNGRTTFFPSAHKLFTI I IH I 


1392 


15293 


A 


1400 


28 


339 


YSCDHNSVQPQTPGLKQSLCQPPE*IAG 
ITDARHHIWPIFF\LKRSFVFIAQAGTQ 
WRGLIj*KKKKVGGFKKGSCLTLPGRGDY 
RRFREEKILIPGRGFCNELI 


1393 


15294 


A 


1401 


390 


1 


fglgkhvlhmtskaqsxkektd*ldflk 

I BCNFISFFS KDTIKGMKRQARNGEKX FPRH 
QS /DDFP* WAIDIiTFPKEDKVMANKHI K 
RCrjRLLLL\SMRYHYH*NKKTDHTKC*R 
ERG\ELELLCCW*EyKMVQ 


1394 


15295 


A 


1402 


149 


402 


rfwihlyrkrvfppllnpvitwkekrgf 

VFVFVF*DRVLLCHPGWSAVPWS *LTAA 
STSWA\K*SSHLSLPSY*DHQHASPCLC 
N 


1395 


15296 


A 


1403 


1 


403 


KRKSVPELIC/APRC*APCSYRPNLPSS 
PGPHLDPVGPTPTPSPPRPGPARGPPAA 
RPKEPAEPGKEERRGL\PAPGG 


1396 


15297 


A 


1404 


342 


3 


KEDNLIDKGGSPKQQIFKVN*TAIiY/YT 
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SEQm 
NO: of 
nucleotide 
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SEQ TO 
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NO: m 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=I$oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 














ALYSKKIPPRTLIAGKEKLMPGFKASKN 
R/TLLLGANAAG\DFKLRPMLIDHSEL* 
/RPKSTLPVL*KKA*IjRAHIiFIALFTKH 
FKPTVE 


1397 


15298 


A 


1405 


498 


66 


knyakstlpml*kwnskalmtahiiptaw 
ftkNffmptvetycsekkrp/sfkmlll 
idnahshpralmeiykeiniafmsakta 
stlkpmdq\kvilaykssylrni \fhkt 
taals sdssdasgqskfk* i wkgftrtr 
grtrgsar 


1398 


15299 


A 


1406 


3 


411 


VSPLCLGRSQTPEIK*SSPLGIiPKCVDY 
RHEPLYLASMGVLNCP ISNLGQC IPP * A 
AWWSIiLLGCXPILMIiNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS * VAGTLGTHHYAW 
LSMLLGSTLi*DVFPFIiSPSLIPSEE 


1399 


15300 


A 


1407 


361 


2 


IRVPQKKKI KS PGRGHKI FS F * RAGPPP 
FFSFFFLRQKESHSVTQAREQWHDHGSL 
QSRLPGPK* FSH\PPPPHPPVTGTTPLF 
lYYYYYFLRQSL/DSVALAGVQWHDLGS 

LHPRVRPRV 


1400 


15301 


A 


1408 


393 


2 


PGFNISVIiKKASGGLFFFSPLGKKGFFS 
QPFFFGSPRVFPPPPFFNPPPFFFFCCP 
L*KIFFSPPPGLKFFFFKRAPPFFFFFF 

fffpdrvslXccpgwsamvqsrfsatst 

LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


LNVNPNL * S YLD KHRGMLYH I LQS KEFL 
SKTSKAQAKEVTLCKWDYNKLKMFCTAK 
KTL)/lKVKR*STE*KKIFVKY/STrK/G 
* T YR I FWKIiNNKNWTQFKNRAETSRSKM 
AE 


1402 


15303 


A 


1410 


422 


2 


VNDRKMDGWMNGWKEGR^TQKKKGGRRK 
REKERSMFLEGRKEGREKEREEREKESQ 
KERQKRGKARKEQRQEARKQGGR/EREK 
ERERERGREREGERGRKSSS*TGSKWK 
I ILLAVKMDVHLKSIiHLQKVCVFSFSRS 


1403 


15304 


A 


1411 


398 


64 


PGFFFKKIFVENFFGPHFQFFFPPPPKG 
KNPL* PPQRGPG* XRFPGVFPPLKKGSK 
KKFPPPPRGTQPRWGKKPLLPKRPGFPF 
GPRVS FFCL I E IGWFFWAQVP KKKKKK 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP*FGYPPG 
FSPPPFLKTPPGNFFLGPKKKKFFFPPP 
GEKICFIi*RAPPHFPFFFFFFFWPPRQA 
ICFIIE 


1405 


15306 


A 


1413 


3 


387 


TPDLK*LTRLSLPK*WDYRRATTPG\LP 
PFKPPASLGSISM*yHSPVCLASKLRIL 
PDSSFSPSATAKHPLTSAPAAAWLSPFA 
LIQPHSSESSLYSSYCPVPILLF/IYLF 
IFLRQSL/DSVAQAEVQWRD 


1406 


15307 


A 


1414 


409 


1 


KRSGPFFYF/SPPFKKGSFAKPFFLGSP 
VFPPPPSSPP\GFFPFSPPPKRGPFPNP 
PFWGAPGFFPPPI.F*NPPPDFFFWGPKK 
KKNFPPPRGKKFPFFKGPPPLFFPFFFF 
FFFFFFFFF YKI I S I KGE IGRSS WR 


1407 


15308 


A 


1415 


3 


385 


HRPPPVMGLS PRPAAAEGVPGPPAVIiAH 
QHHAQFIiARP*LPPHGAGLRTCSPPCLS 
LPNPPPWAPVWPEPP**ALPPAP/AVPG 

PIN 


1408 


15309 


A 


1416 


2 


238 


PGGRGCSELL,PRHCTPAWVT\SKTPSQK 
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SEQ lO 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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SEQ ID 
NO: in 
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26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K~JLysine, L=Leucine, M^^Methionine, 
N=Asparagine, P=Proline5 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RKKREKPGSL*EKTLFFCQTLVFGPKKK 
NGPPLLGRVWAKRKGGFHPPLKN 


1409 


15310 


A 


1417 


3 


394 


NKHLKRCSTSSVIREMQI KTTNRCHFSE 
*KHW/IINMCW*ECGQTELSALLIRC*\ 
WDSEE4VQPLWERKEYLKEITAESSWDPA 
I PLLD I \ YPKRNEAS C 


1410 


15311 


A 


1418 


307 


357 


LETRKTAEKIKRRAGFEK*NLETNKNGN 
TISQNLRDVAKEVRREKFIAMNA\*SKK 
KEKP * INNLMLSLKEIiEKDEQTKPKVRR 
KKKILNIRAEI IRD * KNSGKD * KKSWP 


1411 


15312 


A 


1419 


341 


3 


I PVFliVGRGFPPLTRMVL I FWPHDPPPL 
PP*VIEGL,FH/LPAPRPGAFLRKMVPPY 
*KNKIPPPRKNTFFFLKPPPPFFFF*DR 
VLLCHPGWRAVAQSWLATASTSQAPAIL 
PPR 


1412 


15313 


A 


1420 


405 


1 


WGEELVPSGRVHEIiFSSALRSERDTDSV 
CSCGGQCHCPGAI PGAALKAACRGDPAS 
RVGVETGSTMGNNGFFSFFIFKPYFFRD 
RVLPCHA\AMAQS * LTAASTS * \ VKS S F 
CliSLRS S WD YRCVS PHLiANFKTF 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMNIIiY**IN 
T I LH YS YICNYFLFIjFLRQGLQ PLQAGVQ 
WQNHGS LQL* TPGVS FLS S WDHPNPANF 
FIIFFLERQGLTLLPRQVLNS *AQ/CNT 
PASAFQSAGI IGMSHCSWPYKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPG\GGGCSELRLCHCTP 
AWVTEQDSSYKKKKKTPGGGKGVF/C* K 
LGGGINF* KRVFCFGKGPTKKKTGGEA 


Z415 


15316 


A 


1423 


369 


1 


EHFCKSQDRDSVPKKKYIYIYPMSVFMP 
INLTNW I KFLERQKLS * Q I EE AI ENLFS 
I Y* / PNESEFI I * SLPITKAPGPYGFTE 
FYYS FKGEAMQ IIiHKLFQKRG* RHSFSN 
VIKTLKSKPEK 


1416 


15317 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCLKQVYTVFFETE 
SCSVTR*/EVQQQDHGSQQPQPPRFQ\H 
PPTSASQIAETTAILSNFRKICIYFLRQ 
SIi/HSATQA*VQWRNHGSPQAPPPG 


1417 


15318 


A 


1425 


401 


3 


PFFFFKKNFPFFFQVEGKGRKLG* QPFP 
PG\LKNFSAPPLPGGGN*R\RPPFARLF 
FFFL*KRGFSPIiNQGGLGSPPFFSPP\R 
APKNVGFQGITPPPGNFFFFFFFSEAES 
RSVTQAGVQWREPGS \QPLPPGF 


1418 


15319 


A 


1426 


386 


27 


CRrNGVLILC*CECKMAQTIi* I * FGNFL 
QKKLDLTCDSTI PLLGI YPRDMKTY\7H\ 
KKTCT*MFTAALLI ITKSWEKLLRPSEG 
EWXNSRHPYDRIVLSNYETHWVWKIQRH 
YAEAKKTV 


1419 


15320 


A 


1427 


398 


2 


GLQVHATTKFFKFFVETGSCYIAQAGLE 
LLVSSNPPG/SSFPKCWDYRCETRRWPF 
jCiFjK:FSi:iPIFVF*IjWHLFCLHLLPLFILK 
LNIiPCFMYYLFIPPALYSLSLLALFRND 
CDLSPIHLEVKKYFSKKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGVSLCCPGWSQTP 
GLKRSSHLFFSKCWDYKR/AAITLPSPF 
* IALF*ENPPNCHSLPTPIiSPPKLAALG 
NSCLHPWLPQPPVKGLSDSFVCMHPGS 


1421 


15322 


A 


1429 


388 


131 


RGGGCIiQRHXRGFKQENWFNPGGRGSS* 
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residue of 
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sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G^GIycine, H=Histidine, I=^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=pos$ible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSMYCPPAWGAQGGSLSQKKKKKKKKS 

KNYQKILSTQKTQKSRVFIWSKHCTRH 

CEK 


1422 


15323 


A 


1430 


184 


413 


VSFIiFDFYFIiFF*RW/RSHSVTQAAVQ* 
CDCGSLQPPILFYFLRQSL/NSVAQAG/ 
VGSLQPLPPGPKQFSCLSLLSIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRPPTRPQGFALVA 
HAGG*WRDLRSLQPPPPGFKRFSCISSV 
LWCTKAFNFDFQE X YSFSS * I * GFDSFE 
VNFFFIiEMGVSL\FARAGIiKLLGSGKLP 
ASASE /SGGIIGVCHWAQLL 


1424 


15325 


A 


1432 


373 


29 


rqgfppmgrwldslkkkkfqipnskif 
pgggaprjvippsregeagkff*p\ggkg 
aikqnyc/hcpptwgkegnfvskkkkkk 
ekkmhqk*rcvk:dqq*t*mllfpslksi 

YYFK 


1425 


15326 


A 


1433 


389 


73 


TDCPAIGRmiRDPVRPLLSSPHRKI / YA 
NLYYYIIVRILTLIQSTDLM* ISPWLV 
LTRVCVCVCVCVCVCI ILPS /CYPVCRF 
MYPTTTVKVQNIS IGQEKQKTKT 


1426 


15327 


A 


1434 


402 


39 


QAGLQLLiAS SDLPALiASQS AG I TGVSHC 
ASPSrSL*APLGLDTFSDFPCFISIDLDSF 
EEHWSGML/SECP*TGI*DLSFFS*VYW 
GYGFLEDHRGKVPFFITSYQSYTPPTQL 
IIABVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


EFSKVAGYKINTYKSIVILYNSNE^FEN 
\KKIIPFI1ASKRIKYLKI/QFAKAVQD 
FYAENYKTW*RI IKEDLNK*KNVLC*WI 

GRIi/NI /LKMVLLP 


1428 


15329 


A 


1436 


242 


2 


FGGGFFWVGTPNKMGPFPQI IKNLF/ TR 
PPENP*KPP/LPFIAFFSLGFLGVFFFF 
*DIVSIjCCPGWSA*AQSLLAAALTSPG 


1429 


15330 


A 


1437 


411 


0 


LLFVAQECLKWFSALGGPGVPCS \LPAISI 

qqpsvwvp*qs\ehvtvlqrfclwpvlg 
paygqiwepvpstl.k/ppppgfk*fs * v 
sllsawdcrctp / ptdpai^tfci fsrdav 


1430 


15331 


A 


1438 


41 


412 


FFCTD*GSLCCSSWTQTSGLKRSFHLSF 
PNGWNYGH/RASVLGLIFPFFLRKKT/C 
P * KKFL*NEDLLCCQLGLEIjLAPRDS PA 
LGPQS VGTANVTPRPW/ IiLFL * TQNYIiN 
SSRVRFSPAPLKYKEV 


1431 


15332 


A 


1439 


1 


406 


HAYNIV\NPSPVSLTQALSSLLMTYGLT " 
MGCHFIiSITLLlLGLLTNTIiTIYH*WRD 

vtrestyqghhtppgq*gprygiilfit 
sqvi fftrff*afyhsslsptpqlirrhw 
pptgitplnplevpllntcvllr 


1432 


15333 


A 


1440 


3 


422 


mekvswwidqtshniplsqsiiiqnsal 
slfssmkte*geeaaeekfeasrawlmr 
/feerchhnvkv-qgVeaasgdreatgsy 

PDDLAKI IDDSGYPKLHI FNIC* KPTDT 

divdetalc*kkpstyflffflkqslal 
pp 


1433 


15334 


A 


1441 


375 


2 


kvsspraenvilyncpplffflcqd/ch 

S VTQAARQWRHHS S LQP * TGL \ AS QVAG 
TAGAHLYPSQNLIGRGIRSLA/SVAQTG 
VQWRNIGSWQPLPVRIKGFSSLSLKSSW 
EYRCPPPCPD\LFIFL 


1434 


15335 


A 


1442 


397 


122 


GQASLEIiPTSGDPPASASQSVRITGMSH 
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Predict- 
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location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaianiiie, 
G=^GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L^Leucine, M-Methionine, 
N=As para gin e, P=ProUne, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^-Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CARPILGSCISLPAYPFCFLKCFLF/CC 

REVI LL * CSGWS * TPGLKQS SGLSLPKC 
WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WEKMFAMrLSD * AT VSRPYR* IjIiQLNKT 
KYKLKIiSQS/MNRHPTRGDVAMANKHTK 
RKSTSLVIREIKTMRYHYLLKRMAEVQK 
/SVKCWQECG*GGGKCG 


1436 


15337 


A 


1444 


10 


410 


CTLFIiS ISPDSAGGICTMIiLFQNTLNFK 
ISIiALGVSDKCFSNFLIMTIVRNVFYIH 
GKKCILYCSPGCAHTHMHTLMHTHIHLK 
QFPLRRGLTLLPRIiECSGAITAHCSLDL 
LGSDSPP**EMYFIYMV\EMYFIIiPPRV 
CTHTYAHTDAHTHTPET I FFETGSHS VA 
QAGVQWRNYCSLQPRPPGLRFSSYLSLP 
GS*WHRHVPP 


1437 


15338 


A 


1445 


431 


9 


GPAQSLDSSVSPLGTISSTVKWRHNAYL 
AGLSGGI /PCDCFLGPRVPWGSGICQWH 
WAVSPTLS\PTVRPPLFRETLYFr*V*V 
PPLETCPQ*HISHCRCVINWCEDTHTPK 
KHLVGVHACVCVCVCVCVCVCLGRSRQS 
EH 


1438 


15339 


A 


1446 


85 


416 


PGCLSLPKIWDYRCEPPRPPHFFIFSS I 
QGPLTDFFLTPLEQVQISPTAKGFLKKI 
PFFFF* *RRGVP IIoARWFKS * PHDLAA 
S AS QKFGI TGVS PHVHP I FLY/ CSS I QG 
PLTDFFLTPLEQVQISPTAKGFLKKIPF 
FFFLRQGLT\RQWSTHLGLPKCWD*GCE 
P/PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


I KSE 1 1 ETESRKP I EKIS / RAQS CFCGM 
VYE I D KLIiARLDRKKMKTQLNN I RN * RD 
EINTDFTDIKRI IRENYKQDYGTKF3SINL 
DETD I FLERHQLPKIiTQEEKDNXiNSP I T 
IKDIEIIV*NLSTG 


1440 


15341 


A 


1448 


2 


192 


SKWIKNLNKELKP*DS*KKPEGNLHDIR 
FGNDFLDVTPKN/MATKGKIDN*TLPKF 
KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


VFIiPPLPPFCFLNFF*GKGGFFFKrFF* 
KKKGVEGPQLGPPPPFGPFKKFFFF*IP 
LPCAPLFVPPTLKPLF/CPPKKKNWENNP 
PPFFFCFFVKKTQFYFFFFFFLRNRVLL 
CHPRWS VMV* S CLAVASTS WAQAI CP 


1442 


15343 


A 


1450 


413 


1 


EA*TGEWCNPGGGACHEPR* PPCPPPWA 
TERGFVSKKKRKQEK/SRIMECFQDNLP 
GPFQIFSVMKNKQEGR*FWTK/VKMTKY 

NA*imTGS * YWKKRENEEECYDWKKLF 

QS JLS * YVKKAR* KPGCWLMAAIPALW 


1443 


15344 


A 


1451 


4 


384 


DPAIPLLGTYPREMKTYVHIKTCTQMFA 
AVLFTVAETSKRPKCPTDE / VNKI * CTY 
I/MTYYSAMKRDE 


1444 


15345 


A 


1452 


3 


390 


IiPDHPGSSVSTPRGVIITGRGFWLLFPW 
VS FFFFFPFLiKRISLLPPNWRGGGQNLL 
NKSPPPRGP*NPIA* PPQGGRKKGPPHP 
PGAPGDKNP/QPFFFFGEKKKIFNPPTG 
GEKKKPPSPFKWGGGGPN 


1445 


15346 


A 


1453 


37 


381 


LILYINVCVCVCVYIYTHTQMGSHIVAE 
AGVQWCNLGMW*P*TPGIiK* /FLLSSGD 
YRIiTLPHLANLYIFFPFFEQRG\FFFLL 
KbV*NSGPKATLPRVGITGLTHTPRPYR 
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peptide 

sequence 
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peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid» F=Phenylalanine, 
G=Glycine, H=Histidine, I=I$oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=GIutamine, R='Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=pos5ibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














GDLFF 


1446 


15347 


A 


1454 


3 


400 


TEMVLHTEMVLHISVWDPLLDRPGRGVT 
WLMLTAELFLYGGTKAFSSSFDV*GPCS 
ETVECPSD/L*A*GPP*NT*SPiyRFAS 
PLIiAFCIYLLETRSP\SVTQSGGK/WQD 
LTSLQPIiPPRLK*SSCLSLPSSWE 


1447 


15348 


A 


1455 


77 


398 


RAEIVPLYSSLGNRVRPPSQKMYIYYEQ 
LHAHTFGNIaAEMDKLVERYKVLSKFT*E 
El\liNSPVSIKEREF\AVISLPQKETLG 
PDSFTGGFYIFKEELMPILQRLFW 


1448 


15349 


A 


1456 


397 


69 


CPPGLSGLPWVA*AVPP/RRPAPPPPPP 
AGIDAGLGDDPLQQTHHA/PPAAAGSAP 
AGCAAGPARGAPRGRSPPRRGSAGAAPP 
PWPPAAAAASSAGGSSAPCASSPAAPPT 


1449 


15350 


A 


1457 


400 


2 


ALFFWAPKKKKNFS PPPGKKFFFFKGPP 
PLiFFFFFFPFFFFFFFFFFFFFPFFFFF 
FP*RRPIFILFYFLNQKNIPRDFFFFFF 
FLRDRVSFCQPGWTAMV*SQLTAA\ *TP 
GLKQSSHL,SLPSS*DYRWVPPC 


1450 


15351 


A 


1458 


343 


11 


QGVHSRKYKEISKLNS*KKNPTRKWAKD 
TNRHFTKDNIQMENKNMKDVQARCSGS * 
F * KK *NFKS YAL * PKCSEI / KKSVTG1N 
WRNS Q I YDKLS TLQNN* WI FSRSLYNFE 


1451 


15352 


A 


1459 


109 


401 


QNYRQLGQWDRIESTVIDPCKYGKLIFD 
QSAKATEWRKDS *NHQTS/WWKESFTRV 
KAIiEEITKRKVNSLASISVKTDVHVIKV 
*LKKKKKKRGGRFKE 


1452 


15353 


A 


1460 


42 


424 


CPAN*NSFSRDR/SLPMIiPRIiILNSWPQ 

WLL/PWPPKVMGLQ 


1453 


15354 


A 


1461 


419 


69 


I FPLKKKKRGGGVGPPLYPPPLGAKRGG 
S P * KQNLKP PRPHKENS PFF YKKKKKGG 
RGGAPFFPPPLGGKTKKFFLPPK/IKVS 
FHPNLFPPPPPPGGKKKFFFPKKKKKKK 
SRNTVWF 


1454 


15355 


A 


1462 


1 


277 


CHTNFLCMS FPPAI SHHPLMP PHPVSRS 
VAQAGV* WCDLGRG/CSELRSCHCTPAWr 
MTERDSISQLIRKKKKLIKIKKKKNKNK 
SVIFLGGGS 


1455 


15356 


A 


1463 


382 


3 


kaggsinqnsppppppgekkkkpppkkk 
kkkknpfwggprilphnpppfbr*r*ki 
flgpkfltplgpkikplfffflkkkkkk 
rmty/hsvaqgpllnkdtlqagXlska* 
rs ppks ksmgqsfhrn 


1456 


15357 


A 


1464 


561 


86 


xtdpilslkaektfgkiqdsflivssslx 

NKPGIEGNIiLS**KASTKQNPIiyiNLRV\ 
LNVFPLRS*TRQVCLLSPIjIiFNIVLEIL 
AHIISQEKEIKKIQI*YKEEKLPIiFTSS 
SLFI *VKNLMEPAKKLLELINEYNKVER 
YKINIKNILIAKNTWTLKP 


1457 


15358 


A 


1465 


3 


221 


RFHRVSQDGLDLLTS * STRLGLPKC / WD 
YRIiEPPCPAKTCIiILNGSC*VFLCyLSC 
FKAQERPSQNSWGAFVTL 


1458 


15359 


A 


1466 


2 


396 


WWPAWTHTPGLKQTSRFSLPECWDYRRE 
P P \ PGLVKFLL/ IQYRWTQHVGLGDWV 
QDHHPHIE I CLHKS VI *NGIVFAYNLRT 
SSIiTLFYLRRGLF/SAPQAGVQRHKLSIi 
LEPLPPSFKRSS\CSSLLSSWDYR 


1459 


15360 


A 


1467 


349 


413 


RLGL* PRKD ITD/HVSLRKDTG/ WPGAV 
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Predict- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M^Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=XJnknown, *=Stop codon, /^possible 
nucleotide deletion, \=pos5ibIe 
nucleotide insertion 














AHACNHSTMAGHRRQIPRSGV*GQPGQ 


1460 


15361 


A 


1468 


403 


2 


riPVEWKGGIFGSFHPPPLRIiKKFFSPTP 
PKS*D/YRGGPPPPGyFJFFFFLKRGFSP 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKRVFF* lYPFFFFFEME 
PRSVPQAGVQWHDLGSIiQALPPRFM 


1461 


15362 


A 


1469 


423 


3 


lEFSPFAPSKDKGAILGPSTIiPLPRFITN 
FFCPPPPINKDIRR6PPPRPNPFFFRKR 
GGFPHWAGMFLSFNLQEGPPPSPPQ/SV 
GF*GRN/HPPRAQFFFFKKFFF*CyPFF 
FFFEMEPRSVPQAGVQWHDLGSLQALPP 
RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPKIKKKKKA\GPFL 
RGNRSILKKKATFPKNLFAPNPGPPKFI 
KKVLSDLQKNLNPPPRMGGNFTPSLLK* 
NKSSKQQITRDFRDLTSPLDQGDL*KIY 
KTFYPKTTEYTFSSAPHGFY/S * FDHKI 


1463 


15364 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKWGGPIiPPAPQ 
FPFKTPRGISTPLLRG* KPPPQTPVGGAKG 
SPPW/VPPGFPKKGGGVPFFQQGL*KIS 
PPGPAPP *riVWGGESPPFKKKKSRPGW 
AHACNIiALWEAKAG 


1464 


15365 


A 


1472 


412 


53 


SRLSFFIiSSEPHGPPSPLGPFATNKVAF 
C*/PPPPF*PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPKGGAGPP/PPFWGFV 
NPKKK\LKPFSKPSPPAAKFKNPEGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


DWNIKPVLISNVMTGLMSMIITLLLQLF 
LIi*LFFFK/WELTFFGGKFHQRVr>PSFQ 
GSLGRGRQFFLLTKTEYHIFFLIKVFNS 
KTEGAGIiAT*IK 


1466 


15367 


A 


1474 


42 


428 


EIIiyiESINRFDVITF*NFCL*CYLMSKI 
NRQASSW/ENV*NQYVGEKILIFLIYKE 
CIQINKK\KLRPKIDTQVKDKQISSEGM 
QMANQHMIQC\QPSLVLNKMQIEIAEGH 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


A 


1475 


87 


433 


PQSSPFSDHCSYQRIiFIiHLVKVSTYRLQ 
KIRKIHKSPGIilNEYFGGFFLFFSFLFFF 
ETGFNFVPQAGVQGQDLG*LQPLPLiGFK 
GFS/CPQPPRTFPFFFFLRVWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLILMTSNGIHFIYFLFLLiLLFFFF^DR 
VSACHPGWRAMS* S YFT*ALTSQVK\QS 
YIiSLSSS *NYRHTAIMP 


1469 


15370 


A 


1477 


493 


2 


PGAAAHACNPSTLGSQGGWITRSKIPEH 

PG\TLWNPRSY*KTKMGGLEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/ CLFETESHSLiTQDRVQWHD 
LGSIjQPPPPRFKQFSWYHI*PHAW 


1471 


15372 


A 


1479 


2 


213 


IDQERERIiMSGDRERDTETDAEKDMGRE 
/RNRyRERERLRG\RRERRKRRDRMT*M 
PREREREWLNSLYRETYRDFETEWVMDR 
ERQRLKRRI.Ij*AVIVPSHSSLGSRRKTP 

fqk:kr*regerksefliqrdi 


1472 


15373 


A 


1480 


77 


453 


SFGDSLTIiSPRLAVQWVYLGSL*PPPPE 
LK\YSPTSASQVIHYLLFFFFPFGKKVS 
FCPQGGGEGPPFGLLEIFAPGLMPFFCL 
NPPKGWVWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPEPGT 
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Amino acid sequence (A=Alanine 
C==Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N==Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Ser5ne, 
T=Threonine, V==Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V^possible 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRPWCFFSPPPKKGFFPPLIFVGPRFF 
SPPPVFK/SPPPFLPFFPP*KKKLFSPP 
PRKIIFF*KPPPPFPFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 


15375 


A 


1482 


484 


64 


QKRQMANKYMKICSPLFVIREMQTNTTI 
RYDDIPTRVAKNKTNK*KSNSTKyCQEC 
ESARP* lYCWWKYKMVQPBWK/S/VWQY 
IiIiKLNI *LP YDPVI PLLSMYPSQIKTCI 
PTKTCAQMYIiVTLFI ITKNQKLAHSARV 
G 


1475 


15376 


A 


1483 


2 


400 


YNKKGQGVAPRYDVDTAANFPKELAKI I 
DEGGYTKQQSFNIDYTALYWKKTPCKSY 
SWR* lH/SIjASKDRLT\r,LVGNWGDFK 
LKPVLIHCSKNPRVPENYTKSTLPVIiCK 
WNSKAWITAHLFTAWVAEYFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFR6ESPPPPKK 
KKKKKKG*DNAEDEAQS*RPPPSI*EKI 
MHLICALTEEY* *IjTAEIIASNIHISTG 
SAYIILTEMLELSKI,ST*WVPKLFYPNQ 

LRT/RAEL*MVILNK*DQQF 


1477 


15378 


A 


1485 


2 


318 


PPPPPQRFGLRGGAPPPPKKKG*DNAED 
EAQS*RPHPSI*EKIMHIjICALTEEY** 
LTAE I lANNIHI STGSAYI ILTEMLELS 
KLST*WVPKriFYPWQLRT/RAEIiSMVIIj 
NK*TSDP 


1478 


15379 


A 


1486 


122 


501 


PRDPPAIALQNAGITGLKQSYFDSLLSS 
WDYKHKPPCAPGYLK\VFFLEN*YLAIjY 
VF**SYQGTSVLTQPPSLIPRHSSPGVS 
VQLFQKK/DLQHGCIiLTPS ISGYSVTWD 
GVQWRDHGSL* P * PPRF 


1479 


15380 


A 


1487 


491 


425 


IPKNDI *AEP *RINKR/ CAKRKMVRKGF 
EAESTICKAIiRYEEIVCSRNTEFVHGWL 
*ENRGQIMKGFGNSVEKLEHN*KFYEG* 
LLYRFE * LAAGF I VFQQ * VS KE I VKAWP 
GT VAHACNP S TLGGRGGW I TRS GDRDHP 
G*HA 


1480 


15381 


A 


1488 


441 


17 


KKTOI YDQL I FNKGAKSTKLRKNSLFNK 
WCQDKrS/IIQKMKVDPYLISjMr\NLKW 
LKDLNVTAKTIKLLKENTGAILHDLGFC 
NAFLDGKPKAQTTKKKQVK*TSSKFKSR 
CQ WL I P r ISTL WETEAGGS PELRSS KPA 
WPT 


1481 


15382 


A 


1489 


413 


I 


IiFWKEMVFLPWPGRVGNPGPK\GWLRLR 
PPPF* PSFQPFWGPPVGLLFPKNPYY* P 
PGGAHGPNPPPPGGKG*GPPRVGGGKPN 
* PKFGPCPPGWATKQKPVFQKKKKPNKI 
KNRTVKVPKVTKFIYYVPYLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRMITFFT* IVTQNFPYL*KKMDIQIQ 
ED*RTPN/RDQHKHTPRHIIIKLSNSQ\ 
NSKRILKTRKK*LVVY*GTHIRLSTYPW 
PKS 


1483 


15384 


A 


1491 


397 


39 


RKFRGRSFAKTIiFSGPGQG/PKGNPGIiI 
PEGPPPFFPWPAEDGGFFFPPGQLPPQG 
PIRGPGPFLGFPVPPPEGEGPPPTGP*K 
PGQKRPKKRGWSPQKKGPKKKTIiFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FGVFWFIiAPRRKGGFPPSHLIWVPPGFS 
PPPGV/SNPGPGIKFGGPIKKIFPCPPR 
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sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 
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beginning 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G!ycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GI utaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 

AGVLQ*AEIVP\FPPT*ATEQDPVSKKK 

PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


IKSP*PDRFIAEFYQTFKELVPIIiLKRF 
HKI /EEGTLPS * F YE /ACVTLI PKPGKD 
TTK 


1486 


15387 


A 


1494 


3 


417 


IIiNNARLKPFLLR*TTRQGIL.IiIiSLLFN 
KI*VIiEFIiAGAISQEKEIKVIQVRNEEM 
SKTLFSQTT*S*GRNPFIKIPliPKMSW 
N/NFI KVTEYKVS IH 


1487 


15388 


A 


1495 


429 


4 


PFCLGSKRFPPPFNPPRGGKF*KKKKIF 
GPPRGGPPLFPPIiWAPKGGGPPRAGGSG 
PPPPKGGNPLFINPKKN/PPPPGGAIFQ 
SRFLGGVNPKIFLFPGGKFLTDPGFPPS 
LPPGGKKKKPRFQKKKKRKKRKVKEKKY 
FL 


1488 


15389 


A 


1496 


3 


433 


FSKEDTPMVNKHMKD/CCTSLVIRSVQI 
KTTTRYHLMPTRMAMCIYIFNYILFLK/ 
3 KNNKC * * RYRE I GTLIH / AQ WKYKMLQ 
LLW / KT I WQLLRMLNTKLS KDQE \ I PLL 
GI *KKKKKKKKKRGGRFKGINFTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL * AVAAVALLVPEATRLTMGNNLT 
VCTPHSIAELLSSKG/DLWLTDNR/LLK 
YQALLLE\DLQLRTFTCLNPATF/VPEE 
TGEPEHDCEWWVQTGKRNNKDHCIiYSL 


1490 


15391 


A 


1498 


345 


22 


KSKWIjIDLHGKCKTLKR*yRKPTK*\LG 
HGNDFLDTTSNAWFI/RKIDKLDFIINK 
NVCSGKVTVKRMKR * TTDRDKI S S KG I S 
DKRLLCKIYIKILTTQQ 


1491 


15392 


A 


1499 


194 


432 


PWCVCVCVYVCMCVFETQS /HVARAGM 
Q*HNHSSLQIi*TPGLKQVSCLNLSSSWD 
YRHTPN* FFFF/ CNFYIiERGGVS 


1492 


15393 


A 


1500 


2 


417 


RD * FMRFK \ EKS YS CNIKI QREAASAOT 
ETiyiAS Y * EDLiAK I INEGS YTK\ SQ I FNV 
DEVAL\KKMPTRTF I VRERKS I PGFKAS 
KDRLILFLG/ANAPGNIKWKPVLIYHST 
NPRAFKKfDAKSILLVLYKLNSKAWRIAH 
LF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/IiLSRTP 
LKDSSEEESSQ*AEIi*AVHLWHFAWKE 
KWPDM*IiYTDSIiAVASGLAGWSGT\WKK 
HDWKI CDK/DWGRGMWMNLS 


1494 


15395 


A 


1502 


3 


125 


RLGLPKC *DYR/RAATTPGLH* F * S WKE 
QRPQ*IiALGRKPVNRNKNTQKYIHrYTY 
TH/P I PTNVYICI YTHIHTPV* YTHI I Y 
I lYLFKVLVFGFLRRSLT/SVAQAGVPW 
RDHSSMQPRPPYKQIiTCH* *IiASQSARI 
TGGSHHTWPALILKLERTEAPVISFRTE 
TSK 


1495 


15396 


A 


1503 


406 


172 


DIILDR*RQKQRLRLRQKQRETETE/RR 
DRGRERQRYRQRQVQRQGRRQRQRLRHR 
Q* QRQIDRLRQRQ/RAERDRGRGRDRDR 
GRDRGRDRERQRIiRQRQR*RERQRHRQR 
/RERDRQRQRQRDRGRRRDRETDRGLDR 
G*DRGIDRGRCSDRGGDRDRGLDIGSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


ITGVSHRAWLPS * FLKPFWVEMESHSV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/535,1 
26 


Predicted 
beginning 
nucleotide 
io cation 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienyialanine, 
G=Gl3 cine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














AQAGLK/PSGLKRSSHLDLPKCWDYRHE 
P\PHIAFFQFF 


1497 


15398 


A 


1505 


407 


1 


PISIYHSKDPGI5LKSYTKSTLPVF*KCN 
NKTWLTAYLFTAWFPKPFKPTVETyC/S 
RKKSPFKMLLLIANAPSHPRALMEMHKE 
I PWFMSAVTS ILQPMDQGWLTPKS YY 
IiRNTFCKAIAAIDNDS SGGSGQ I Q 


1498 


15399 


A 


1506 


1 


408 


PPGFKPFCLiSLPSSWNYRHPPP/RPG*F 
FVFLGGTRFHHVGYTASQHLTSRETHAY 
ALQ 


1499 


15400 


A 


1507 


2 


416 


EPRSHHCTPAWQLSKTLSQKKKKKKRKR 
MVFTGRKKGLFPGN/LKUJLGTLSREYPL 
GPGIKNRIiAQKRKPIjFY!CKRF*]SrrNPGG 
GAHPGGPKSWERGGGRKI*TPVGKYASN 
PEYHICIPPGKRNQIirPFLQKKKKRRAD 


1500 


15401 


A 


1508 


276 


14 


SPPPYFLLIR*GRKGRGQF**LRIiFFIl 
WLRQCL/DSVSQAGFQWHNHSSLQPRTP 
GPK* PSFLSLPSSWDYSHAPQLLAPYGA 
NCFN 


1501 


15402 


A 


1509 


1 


391 


NIFKEIMSENFPSLMSEWFPSLGKEIES 
QIQEAQRTPNKMNPKRSTPKQVI IKF*V 
RE/MLKTAREK* I VI CKGTTLRPAVDFT 
ASTLQ I RRE * DDTFKTLKGKKKKKKTL P 
AKLTFP S I LVFKKTRGGSR 


1502 


15403 


A 


1510 


2 


419 


prvrsratnviskykqhkkkntsk*ikd 
lnvkpepikll\ekttgeklldielgnd 
flds/tpktrapkanltp*nyfkliksfl 
taketfnXktptkrganhisdrgliski 
ykeltiq*qknnlisk*akdlkrhfske 

DV 


1503 


15404 


A 


1511 


392 


122 


svslcvslqssgmfrc/lptmplrr*rq 
enpltsagggcsemrshpctpawvtqqd 
svskk*ink*ik*/q*lrqnvkikpspi 
fceqgkrrk 


1504 


15405 


A 


1512 


2 


281 


GGCSELRS CHCTPAWTT \ SETLS QKKKK 

rkkiciecntlksh/ilsfvcggkyhsp 
vs ctcl* wslgrslkrslerhqknlndp 
ypifqnetnih 


1505 


15406 


A 


1513 


242 


382 


QGPNLG*LHPPPGGLKGFSPLTLLRSWK 
NRLPPQHPFYFCFFRKNKV 


1506 


15407 


A 


1514 


1 


388 


rtrgterdt/ rfrerrrerdrgregyrw 
ryrererhreres * v* rdrvrdgdrvid 
rererqsererlr/rerdwerdtyivre 
teteretegerdibrerdivrdrym/rd 

RDRDIIiRE*ERDRETAIiDSEK 


1507 


15408 


A 


1515 


4 


285 


TRXQICNGDKTALY* KEMPSRMFPARVE 
LMPGFKASKDTLTLLLRPNAVGDVlSrLKP 
MMIYYSENPRALKNYDKTQLCLYSTNGN 
KAWMTDYRFT 


1508 


15409 


A 


1516 


415 


2 


MGDRESLPPSRVFLFFLAPCPKGILPHP 
V*FGSVRSFSIERCYR*GQRNEFWGPVI 
RVKASSSRAG*VVFSSTAPPLFF/CFFE 
TESCSVAQAGGQ*CDFSSIiRPLPPKFKG 
FLCPSLPDAW 


1509 


15410 


A 


1517 


1 


324 


PTRPEMGFRHAAPAGLELLSSSDLPTSA 
SQ SGGI TGVSHRTWP I LASNNY/ SMDKL 
CAI/CERFILFFLKNILLWILILFIFIY 
FCIi3:K*CITINFSQNSLLPHSIHLYI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen>ialanine, 
G=G!ycine, H=Histidine, ]=IsoIeucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P==Proline, 
Q-Glutamine, R-Arginine, S=Senne, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyros!ne, 
X=Unknown, *=:Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRYP WLNF VLLVEMGL \ TMLASDLERS 
G*PQ/CDLPASASQSAGITAESHHGWP 


1511 


15412 


A 


1519 


425 


52 


VLRGAQFIiFQEARliKWDFGPGGMAPPCM 
PPPLGV* KGPLPKG\GGPNPPGPPRGlSrP 
VFSiCKPKITPPGGGAPEVPPPWGGGAEK 
FFYPGGPRVQGTE/RKTPPPPPGRQKGT 
PLPQKQKKKKPWSL 


1512 


15413 


A 


1520 


3 


404 


THASGKSNIRGIiTLPVFKTYYKAAVI R/ 
TVWP/WIiRANTNRQNRL/DGPEVDPYKC 
SQMI FDKGAKAIQ* RTDNNLLNKW* /ES 
* TSTCQKKKKKNQFR/DPAFIil YTKFNS 
NGRKNLRVI GKT * KVLKKNI KQNWGDLG 


1513 


15414 


A 


1521 


332 


39 


TSRETGNFQLDLTAFIiNEVGVLPLWRKG 
KGKI D I G IGTKEMP * FP I * LLMFSY I VI 
ERERKPE/LSLSPSIjECSGMILAHKKNS 
LPGSSDSHDPASRVAG 


1514 


15415 


A 


1522 


482 


0 


RARTSGVLIiCS PGWS *TPDLR * SLCLS F 
PKRWDYRT*ATVPGLLYSLYSRPSP/DE 

LKGCEKSRTSPA 


1515 


15416 


A 


1523 


1 


417 


NKC* *RCESKGMPVYYWYECK*GQPFWK 
TKERFFKKNL/NIELPYNTAIALLNMRP 
Q* I * S Q *RKVCS CMLCAT \ PTPNK* I KT 
MW*VCLVEYSSPLKKN\LLFSTTWINLE 
DISSNK/LRHRKSS ISRFHLLL 


1516 


15417 


A 


1524 


1 


397 


RDSTYQGHHTPPVQ*GLRYGVILFITSE 
VFFFAGFIi/WSAFTRSSLAPTPQLGGHW 
PRTGITRLNRLEVPLLNTFVLIiASGVS I 
T*AHHSIiIESNRNRriQALLITILLGLY 
FTLLQASEYFESPCTISDGIY 


1517 


15418 


A 


1525 


2 


376 


LKAKTGQKLGLLHQTVSKFVNAKEKFWK 
ELL\KSATPVDI*iyiIRKRTSLITDMETV 
* W* KEDQTSHP I P LS * SLTQS KALNLF 
KAMKTDRGKGAVE * KSEANRGWFMQFKE 
RSC/RFCNIKVQG 


1518 


15419 


A 


1526 


3 


386 


ESMLKAKTGQKLGLLHQPVSKFVNAKEK 
FWKELL\KSATPVDI*MIRKRTSLITDM 
ETV* W* KSDQTSHHI PLS *SLTQSKAL 
NLFKAMKTDRGKGAVE*KSEANRGWFMQ 
FKERSC/RFCNIKVQG 


1519 


15420 


A 


1527 


127 


388 


KRKSQINNLLLQFKEIiEK/QEETKPKAS 
RRNKKKKIRVDLLKI KKGKPLEGVKKKG 
GFF*RTNKRDKPLLKLPKKKGGGRIKTF 

HBcrs 


1520 


15421 


A 


1528 


3 


402 


HBNHMKICSTSYVIRELQIKTTMK*YYT 
PVRTTAIQNTDHTKCWQG*REQ/GSLIY 
CW*QCKMVQ/PLWKRVWQFPTKLKHSliN 
I *SAVVIjLGIYP\KSGKFNVHTKTCT*M 
FLAK* PQCSSVDEERKKM/WYMYGFE 


1521 


15422 


A 


1529 


2 


365 


IEKLYRSGAKFFCRDRVSLCCPGWSRTI 
GLKQSSCFGFPKCWDYR/R*AAIiSGRI,L 
liTISSH/REQIEQEFTHYPEKSAKLFMQ 
DGPPRRK/HPPTRPQASNIQNKFS FFLR 
QSL/NSIiAQAG 


1522 


15423 


A 


1530 


1 


418 


GTDTS * ACDKIQKPFPDKRLNKLG/ IRK 
KHLQLDKYL/WKNPIAS IILKSRRLKS V 
RLRPGDGRQGCSFPAWLFNI ILEGLARA 
IR*EKQICQKQGQ\QIGKKEVNLSLFIED 
IMIiYIENLKESTKKPI I IINEFSRKEDF 
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SEQ ID 


SEQ ID 
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SEQ ID 
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Amino acid sequence (A=Aianine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutaniic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I^Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=:Stop codon,/=possibIe 
nucleotide deletion, ^possible 
nucleotide insertion 














RP 


1523 


15424 


A 


1531 


381 


3 


ILQGFVIRKKSKIFQKACHTSDLWTLTN 
LSCLHMKGLPPHRIjYSSCTVLIiFLRKPV 
MRKTTLSCCFLT*DFFLPLIjPPCFFR* F 
ILLIF*LyFCKDKVSLCCPSWSKLLP\Q 
QSSCIiSLPKYQDYKA 


1524 


15425 


A 


1532 


38 


479 


dbacgpqdp yltpyvktqwi kd * trnkg 
iqpleengknXfdigfssdlldmtpktr 
atkvkl\ndirlrnfcaskdtinlsllc 
r\kr* pvewekisanhisdkglisgiyr 
qpplnsktshli * k*ardfnrhfseedi 
qsalyrwvl 


1525 


15426 


A 


1533 


105 


447 


LIFCRVFEYLHSLHLPQEXCLSIiALFSR 
PTFCVI IGEVDVWSVIFKVPFCSKRNKV 
AVHTMIiYIQIFVSIiPI*PQNWKQPKCPA 
TVERIWKMWYIHIV/EYYSANKR 


1526 


15427 


A 


1534 


76 


471 


VWVCLLSLEGSQSKPGNS I EFGVLLS SG 
GFSAWRLPPFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGPK*VLKQRGVCLFV 
CFETESHSIAQAGTQWCDLGSLQPLSPE 
FKRFSCL\SLSPPWPSG 


1527 


15428 


A 


1535 


45 


338 


SNNE P FLDWI VM * RKVDF IRQPAMTS S V 
VGPRRRS KALPKAKLAPKKVMVTI WWS S 
ARLIHCSFLNASETIASEECTQQIDDMH 
* KLQRLQAALINRK 


1528 


15429 


A 


1536 


425 


1 


FFNXTFHSVSFSPLSQKCILTL* XLLKS 
IVKNNMRNFQRSLVRK*AKDMNRHFTDD 
HVQMAS / KHMKRCS LVIGEMQ I KN\TVS 
YHYP P IRMIKVRNS SNTKCW * GCGQTGS 
LMRCWW/NK*]SrQLIiWKTGI PPHGLVSTR 
SRVP 


1529 


15430 


A 


1537 


4 


443 


ETFVDHYQCGGIRPFSDLQLHAGRTTAL, 
FKAVRQGHLSLQRLLLSF\VCLCPAPRG 
GAYRGRQVSLSCGGLHPVRASWLLCLPK 
*AWTiyiEGTSTPASIiPPCSLISDCCASNQ 
RDSVG I GPSEPGAE YNIiLVPRFLiSPSEK 
RSIWVGV 


1530 


15431 


A 


1538 


487 


3 


TQNGGVLLSAPRSVFSPTTLR/CTLQAQ 
C*AFWWGGTQQAASSTAA]yiAAiyiKPLGIW 
AGGAAGILPKLGFQD/LPLSAEADPAGK 
ELSIGRQRAWREQPDQSAEPPFSQAPRP 
GYPPSPQPLSMRRGPGANPRLARPLRGP 
VRVRLRRAS SERQKRSRGGSGPLG 


1531 


15432 


A 


1539 


394 


489 


I YLFI FETESRSVAHAGMQWRDLDSLQP 
SPTG 


1532 


15433 


A 


1540 


475 


202 


PGGGWFSPPDNFSLKENSQGGPPVSHPP 
PPPGNGEGGKTPGAGHSGI *NPPPPLKI 
N/r.EKG6RGGPPGPLrjNPPFPKEK:PRGG 
KKKKKKKR 


1533 


15434 


A 


1541 


14 


468 


LSMWWNSRIiKARRIiVIiVSQWNAXKERF 
LKEIKSVIPMNTLMVRERNSLIVAVEKV 
I,VA*VKEQTSHNIPLCQSLVQSKALTLL 
HSVKAKRG*EAAEEKLEAGKHWFLRFKG 
KSRLHNIKVQGEATSTDGELQQVTYPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDSWDLIK\SFSTAK*AINEAD6QTT 
EWEKTFANYASDKGLISRKELKQINKKK 
ANSPIKK*AKDTKKQPTNMKKCRTSLII 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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NO: of 
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26 


Predicted 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaniiie, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=*GIutamine, R=Arginine, S^Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














REMQ I KTTMRYHIjTTESE * LLI KSQGRA 
L 


1535 


15436 


A 


1543 


17 


474 


NPKKKKKKKIDLLDTYYTLYPK/AEYTF 
SSEIHK]SIWLYIN*LRANLNKFIN*KIKT 
ILSDHNRRQPEINKCaJKTKR/RVT/TQK 
FK3S7LILSNYWVKGELQTQIAEFLKRNVN 
K]SITiyLNT*NTIKVIjNSGGST/AIiNTYI 
DKNFKKRVPS 


1536 


15437 


A 


1544 


89 


481 


ICLKVISIjRQENATSVACIVSCLSEGSR 
ASENLVLHNASPNSCGLGIAFTCLCNW 
SSGNKITDCYD*LSSS/WLGFFFFPFLK 
KSFIiFVAQAG6QGGNLG*PKPLPPGLKQ 
FL.PPPRANFCFLEKT6FFLF 


1537 


15438 


A 


1545 


1 


300 


PPPPAPXXCRPY*XPWXXPVyHS SWRHG 
S GAAQGAVLAGFGGVGRQGPGAAS VS I P 
LCPENQ6CREPGPSHAVPAPSALPSLRS 
LTGGQIGGTRAAQAVG 


1538 


15439 


A 


1546 


2 


436 


GAPE IKS IIKGYWD/RLCTTKFYISTLDEM 
DKFLVRHKLPKLI*E*IDNLNRWITSQE 
TDW*I*QQSSSSSSSSSS/PSSSSSRPN 
GFTTESYQSFEDKLIPIICKLLKKIDKE 
\GHFPLQIi*GITQIPKPDIYH/lENYRP 
ISLM 


1539 


15440 


A 


1547 


54 


419 


PATWEDHLSQGGRDCSEPRLHHCTPAWV 
RE*DPTSKKKKEK/NDQ*LLIP*S*RRL 
*RSSHQTSHLMDWNSEAGR*EATFLRSY 
S KL \ KEELELLL I liDNNDDHNNSD* YLL 
SIYSIPGTVLS 


1540 


15441 


A 


1548 


37 


339 


KRWKCLRA* LLMRPRHT^D * LNPDGVGYS 
EL*SRHCTPAWVTEQDLVSISTNRKNER 
HTLEYSHQHYSG* P * TG/EESEYPSAL * 
QREIIDYSFIQGMTDQL 


1541 


15442 


A 


1549 


477 


1 


PGSHDLGSYT*PQ/WSSPEVTSRDAPS 
HPSAPKFCSNPCRGRWLTSSKQPKLRLT 
SAVPGAPGARGTFHALGAGAAEAGGHSA 
SRPEAALGRPLP PLPMTLTSHPLLSGPG 
RLAWGCNGRR* IKGGG 


1542 


15443 


A 


1550 


430 


8 


CWPGS SGTPDLK* STRPGLPTCWDYRHE 
PLCPASKTFLSPQIETPYPLNNNAHSPR 
RPALWYSLLSVSMDIiPIL\AFHTNGIT 
GYVASPAPPVRWGSHSVAQAGVPWPNLG 
SL*PPPPRIiK*SSRLSLLSSWDYSHiyiTP 
LN 


1543 


15444 


A 


1551 


2 


419 


ETSPSIiQGWLGVLPPKRGAKTSRFLI IR 
PQGGSFKDGDPPNPPGEIKTPPAKKKKK 
KDSARS PPARLQA* GAGLWDARAS FRRP 
FQAPVPLSAQPRAQPLAVGTSRDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1544 


15445 


A 


1552 


2 


387 


FRHVAQAGLELLG\SSDPSVSASQSTGI 
RGMTYRAQPGL* LFNSKNS ILSGPKV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIFFFLKQGFPFFP*IiEGRGKFIiGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFPFIiKRGFS/LFCPGGSFSLRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGVSLCHPGWSAVAASRLTA 


1546 


15447 


A 


1554 


1 


427 


LLLLILYAIVEAVT^TEGAGYPPLPGSY 
SHRGASGHRAIFALHLTGGCCI * GAMHC 
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USSN 
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amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Giycine, H=Histidine, I=l5aleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V^possible 
nucleotide insertion 














ITTMIDIEPPAITQYHTPLIiA*SGriIT* 
GL\LVLSLPVLDAGITILLTDR2TLXTTL 
FNPDGGGDP ILYQHIiL* VFGHPE VYIRI 
LPR 


1547 


15448 


A 


1555 


3 


382 


TLYLriFAA*AGGL/GSALSLLIQAELGQ 
PGISTLLGNDHIYNVIGTAHALVI ILFIVT 
PIIIGGFGN*LDTIiIFAAPDMAIjSRIjNN 
ISL*LIjHTSVLLLIiASAIVEAVA*TG*T 
GYIjCLAGNYSHP*ASV 


1548 


15449 


A 


1556 


380 


3 


EVSPCCPGWS*TPGFKQFACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 
EVNSRPGLYLFIYLIKQKKGVK* *PARY 
LVIiFFCFFVFVFVFVFF*HRlS\SVAQG 
GVQWCDLGSLQPPP 


1549 


15450 


A 


1557 


100 


254 


I PTVPT Y* TP IKSFHRARIESSGPGYS W 
PVDSAK*VPLAVVSLDSR\RDSGNLVHP 
LMRVTO* MKRHL VTLTQS \ C YS 


1550 


15451 


A 


1558 


2 


289 


APGVSSTMEDEMGGGLEPQRRGCSKQRS 
HHCI PAWATE ^DCLKFNNNKKNYLIiL I S 
LRTQS YLYF/ C * VWS Y*LKLS I KLAGGT 

GEKEH*SQKRKSK 


1551 


15452 


A 


1559 


24 


354 


PLPSAS PGPEGATPVPTS /ACPNKIKIiY 
HLKKKKKKKKKKKKKKKKKKKKKKBCKKK 
KTKKKNKKRKQQLNE * ETG 


1552 


15453 


A 


1560 


376 


2 


aargsgvrdpleeavclfsdlqlragrt 
tallkalfk\rqghlslqrlllsf\vcl 
cpaprggayrrrqaslscgglhpvrasr 
lliclpkqawamagapppaslppcswisd 
ccasnq * ds vgvg 


1553 


15454 


A 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FPFLAPGGAPGGLFSFPEASSPGLNPFF 
WPNPPEK*KKGGPPPPPGPFFFF*KKRG 
FPGG\PGGAPFPDPKXGPPGPPKGGEFR 
GGPPPPGPNFFFFFLKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSEPRWCHCTPIWVIMRDSVS 
KPKQSKTKKRNVFT/C*D*VF*lTNNfINY 
INCFS * RAL YIL YEKQFM 


1555 


15456 


A 


1563 


410 


1 


TPPPPPKNFLGTPIiFPKNKAGKGLFPPP 
LGFPKGGQWPIPPQRFPPLFSPKKRAD* 
KPKPPPAI/KIRGPNPGVFPKGFCGFSFF 

PPPPFPRGGGLIFFLPP/TK*SGG*KKP 
KKKKKEKCPKKQRGGIDQLTSNLGV 


1556 


15457 


A 


1564 


2 


374 


ADRNLNTTFFDPAG / GVGDP I LYQHLF * 
FFGHPEGYILILPQFGIISHIVTYYSGK 
KEPFGyiGMV*AMISrGFLGFIV*AEDHI 
FTVGIDVDTRAYFTSATII lAIPTGVKV 
FS*IiATLHGSNMK 


1557 


15458 


A 


1565 


396 


0 


1 1 FLIFLRHGFAWAQAGMQWCGLGSLQ 
PLFPGFRLFS/CLLSSWDYR*RQGPTML 
ARI.VLNS*PQAI\SASPSQSPQITDVSH 
CA 


1558 


15459 


A 


1566 


186 


452 


KQKCNFKTIiNTKYQPLPFFFLERNFCFC 

PPGGGEGADFTFLEPLPSGAKGFF\ CLT 
LQRM/WE*RVSPPTPLNFGFLVKKGFSL 
CGSTGF 


1559 


15460 


A 


1567 


453 


37 


KTALYWKKTPSSTS lAREEKSVPGFKGQ 
AHFLFRGKHKFKL VS KL I YHFENP /R/ A 
LKNYAKS I L / P YKWNKAWMTAHLFS P \W 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=pos5ifole 
nucleotide insertion 














PIKYFRPTVEM*CSYNKILVLIDTGLt>H 

PRPLIETYK\FINLHVCVLFNTTPIVQP 
LPR 


1560 


15461 


A 


1568 


453 


39 


TALYWKKTPSRTS I AREEKS VPGFKGQA 
HFLFRGKHKFKLVSKLIDHFENP/R/AL 
KNYAKS /TLAYKWNKAWMTAHLFSP\WF 
I KYFRPTVEM* CS YNKILVLIDTGLDHP 
RTLIETYKXfINLHVGVLFNTTPIIiQPM 
PRAQ 


1561 


15462 


A 


1569 


1 


352 


FGTRRERERERERERERERERERGRGGA 
GHQIPSIEGQHKIVGAIIY\CREC\GEK 
ISGSTSYIKVWDIRDSAKCIRTLTSSGQ 
GISGDACAATFTRAITSAQGEH* inqia 
LRPSGT 


1562 


15463 


A 


1570 


394 


3 


TLDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNTSTAEGLrjHNQAGRTKDGS 
FAPSHDH*A/PRGTEV/DLLESTIiQTS I 
KQVESKPR\EQARTGAGGQKEKATQNPE 
KSVLTSMYTKSQGSEGRLPGNR 


1563 


15464 


A 


1571 


399 


2 


KHQIiPVFWQYNKKAWTTRTLFLDWLHCC 
FVFEVRKYIiASKGLPFKWIilLDNAPGH 
PPRTP*VQY\KGIEVIYLPPNTMSLI*F 
LDQGVIRTYR/SHYTQYSMQRTISAMQE 
NSNKENI I KVWKDS TTDDAI VA 


1564 


15465 


A 


1572 


37 


400 


RGT I TGEAASADQB lADKLSDAINKIME 
EKa\Y*LPEQVFNLDESTIiFWG\KKKPQ 
RTLLSKEKKRAPGFKTGKDRLTL,L,FC/A 
NAVKLI IRTALTYKAENPQALKENVKHQ 
LPVFCLTTRGL 


1565 


15466 


A 


1573 


84 


485 


AGHKDS PRPHQTQE PS WLHLWDPAPGLQ 
VSLPAS P \GRALiALLSPWVVDGTGRPGA 
GGGTRRGGS GP TGAHGAGGRLMHGGLQ V 
PSPAPREGS*GPARYQAQRMWARTAGGP 
STPSAGASRVPSPHCPGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGLRDPSNMRKHAYCGCCV/ CITLC 
VGAQNiCNRAVCGIiYSTCPRLCVYEHEHI 
CVNE*VCEHVCERESVRVCESTH/LPLC 
A* TCGP I FGCMSEKHVPSYTPCVHRVCV 
CVCVHLGCCVC/VCVCVCVCVCVCVCV 


1567 


15468 


A 


1575 


1 


383 


FLS FGFAPQAGGQGHNHG*GP P * P / PKA 
KGIFPPHPPEKREQRVHATPPGKFLDFF 
FFWKKGGIiNIiGPKKNLGPGGKKNLLVS P 
PKGGGKKKETPGPGGVFFWGGIFFFPPP 
PCSPG*PLSLLKKPKGG 


1568 


15469 


A 


1576 


35 


469 


RIPRCHQPVGPLGCREGAKPQGPGPDAA 
DSHPPAS PRAH* / P * SIiHRDP I PGFRGP 
CRRNAGAGPRAHTAGICAPQPNS * SRHH 
PWAAAPAGDDSPTTSLGSHVPVGRKSSD 
GCRQRALCTPGS PAPSEAEVGGS PELRS 
LRPAWA 


1569 


15470 


A 


1577 


473 


62 


SLEINPYISGQLI FNQHAKTNH* SMRKE 
\ S FFNKWCLDN* I STGRRMKLDP YLKPH 
IKLNSK/LKDIiNIRDTTMINIiHDLGFGN 
GF*TMTTK*AIKEKIN/WDFIKI*1TFCA 
SNDI I KKVKRPNVTAYTCNPSTLEGPR 


1570 


15471 


A 


1578 


59 


426 


LERRS FGAPLS PFFAPQFEMKKGFGS P * 
KFFFSPKALNFGGGWPFFPPPKKRPFS 
KNPQEGFIPPPLKKKKKTSQPP/YKPGP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C^Cysteine, D=Aspartic Acid, 
E=G[utamic Acid, F^Phenylalanine, 
G=GIycine, H=Histidine, I==Iso leucine, 
K=Lysine, ]>=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, X'^possible 
nucleotide insertion 














PKES FKRAPP FFFFKGGPFPPIjVSNQWT 
GKPRLPPPPPKETQNPSSFQIGGQKKGK 
GGPQSFSFPIKKKKKKGGHSRSRTSPRV 


1571 


15472 


A 


1579 


2 


419 


KANKTKNAYFEGINKMDKPQVIIiKKKy/ 
REKT* IPS IGEEKGI MS P \NSEDTKRI I 
KECFKQTYAHRFYSIiKE/MDILIjESHKIi 
PKLTQ/EETDSLNSPV 


1572 


15473 


A 


1580 


129 


403 


YMFFIPINCQDHPK*KKKKKKKKKKRGG 
/RPFKKTLRGPKLNHAEKNKFFF*KGS I 
KKKCLE ILKKKLFFGGEKNCKNPPKKKK 
PSREKKKF ) 


1573 


15474 


A 


1581 


317 


76 


PRFFFPFPPPKKGFFSPPFPFFSPRFFP 
PPPFXiKPPPRPFFFGPFKKNFF\PPPRP 
LIFPFF*APPPFPFPFFFPPFFFV 


1574 


15475 


A 


1582 


259 


377 


PREMKT YLiPTKKLGYE I SH YH * WWGCKM 
VQPVWKTVWQFLKGLNIKLP*DSAVPLV 
GM* PREMKTYLPTKKLGYE /MFTLSLLI 

TANK* KQPKCPRMMKW 


1575 


15476 


A 


1583 


1 


415 


PTRPITSSICLRQSYLKALIAYSSISHI 
AL G V \ T A I LNP T P * S FTGAG I L 1 1 A \ HG 
LTCSLLFCLANSNYERTRRRI I ILSQGH 
QTLLPLIAF* *LIiARIiANLALPPTINLL 
GELSELRTTFS*SNITLLLTGLNILGT 


1576 


15477 


A 


1584 


216 


406 


LNLLPLVLGGSSCLPPPCGGN*KPPPPP 
G\LFLEKKGFSPCGPAGF*PPALRGPPP 
PPLPRGLI 


1577 


15478 


A 


1585 


383 


3 


KKFGYPFYWIG*KIIjK*FPG*K*SLPHR 
KSPFFFFF*GRVLLCPPGWRGTTKGHGS 
QVT\liPAAAMTFQV\K*SSPLRi:iPSR*G 
YRQASPWSGNFFF/CLVESLSMLPGLIL 
NYWAIAIKPSGPPKVLG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK*KMKENRMKKNEQSLRDL*DTI 
KLTNRCrLGIPKEERKKGAEGIFGEIMV 
GNSSNLIR\ENINLiNIEVAQ*TLSRIN* 
KRATLRHVIMKM 


1579 


15480 


A 


1587 


242 


409 


GWMIFRFNFFFFIiRGSFTLVTQAGG*GG 
DFG/SLRPPPPGLKRFSCLTLPRSWDYR 
H 


1580 


15481 


A 


1588 


2 


338 


EIEKKGKGKKRRG*RSNKKKRGGRLEGQ 
KY*SPPCPRLSFFLVDEKGPEVLQDSL6 
GWWKTP/SGCPEMTDSSQPY/YRAFYVIi 
KN/QRVGFSVDVGE lEKDQDVEKNQDPS 
CPRL 


1581 


15482 


A 


1589 


360 


0 


NNFLEQLKFPPKKKITTDSYKAPRPFFF 
FPPSQKGVFPPTLFFGFPPGFPPPPFLN 
PPPGFF/CFWAPLK:KFFFPYPGG*rWVS 
LKGPP/Ij/RFFFFFF*DGVSLCHPGWSS 
SAQS 


1582 


15483 


A 


1590 


75 


412 


VEGQHCNFCAAQET inrvkrqhtellet 
fanwspdkgliprtykelkhlnrkkhsy 

* KWADDLDRHFSKETYTPPKTYS ctqyc 
*P/SITIEN*KBKTSNIKTRi:iQGGHKRL 

er 


1583 


15484 


A 


1591 


309 


1 


FSTQGAHMQVCYMDTFHDAE I WTS I EP V 
TQIV]SnyiIiPNR*FFNPGPHFFPPPFCSP/ 

lvsfllllrdrvslccpgqsrsvgfk*s 
tclglpkrwdykcepphva 


1584 


15485 


A 


1592 


2 


415 


LSISLSIFSFLP\FSFLLPSLSliSLiFLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 

peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E==Glutamic Acid, F=PhenylaIanine, 
G-Glycine, H=Histidine, I=IsoJeucine, 
Kr=L.ysine, L^Leucine, M^Methionine, 
N=Asparagine, P^^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFLFLS F FP PS FF F FFHRAS LCHLGWSA 
MA* S WLTAVS TSQ \ VKQS SHIj 


1585 


15486 


A 


1593 


3 


396 


HSAFLFFL*DRVSLGHLSWKAVAQSQLT 
ATSTLL/VLKQSSHIjNLP 


1586 


15487 


A 


1594 


300 


62 


KMTGVLKSSCGKS PKQVGWLCVCVCVF / 
SFKRQGIiAS/VAQTGMQWGDHS SLQP PA 
PGLjR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


FDRPAADQKAASALKASGVQAQMAKGTY 
HDWSLQDWKVLWMTH* VS \QQEQDPTNIi 
YISNLPLCMDELQLBNMLKPFGQVISTR 
I LRD YSGTFRGDGFARME S TDKCDAV I B 
HYNVLIiIMTPPGVSAPTEPIiliCKFAB 


1588 


15489 


A 


1596 


428 


2 


QSESREPENFLLPTSTSSLLLVHCLSRT 
CWNSTLSCVPHPFHLTFNSHIiLLTSQP 
RRVHPC* IjSLSS 1 1 SWKLLNTLPGVP IR 
\ASEIFGLRTIRNFPFLSHSVLFPFSLS 
ESHSVTQQGVQWHELGSLQPLPHGCQWL 
SCL 


1589 


15490 


A 


1597 


2 


442 


QGSL*PHPPMLK*SriTSAF\NYREiVSPH 
loANYFLFFF * KQGIiAVLLGLALN* AQ / C 
HLP ALAS QSAG I TGVSHGAWLFFFFLWQ 
CYriI*FLILIF\ERRNSPVAHVLVNGGD 
LGLLKPPPPGLMGFSCLNLLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NDGYPRVSPPPPFLNPPPEFYFGPPKKK 
FY/ PPPPPAQKIDPP*TPPPFFFFGTDG 
SHYVAQAGLELIiAS SDLPAS ALQSTG I T 
KHEPPHLAETI FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLC YPGWS AW \ * S *LTAASNSW \ V 
K*SAGFSLLSRGDYMCMTLHIANIKKNF 
liGRVRGSHL * S YHLKLCLKK*RTKEGFL 
S FFWKGKRHFLGQKKTIIiNPRLKVFFLPG 
PPKGLGY 


1592 


15493 


A 


1600 


3 


397 


SRRPGRFSLMLTLS WHS * VCIiAALiAARE 
EQWSGCFKSHF*LEVNFWVSGIDREV/C 
GY/LKTVLGEDLNDYVSTQ I *D * LMKPR 
CPEKQDESLLKEFGGGA*RLISrVVHRPGA 
VAHACNPNALGAKGRRIPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTSFHSLIDSCGYLLSYSNFQI*KKM 
I FK/ YLiHIiRAVPRHVIVRFTNLERQEKV 
LRAAREKG*VMHKGKPIRLTADLSA 


1594 


15495 


A 


1602 


435 


2 


PQEAIYTPTQHPTYGAICRIARIHGSRD 
Q/S VEMKWHPQLTITPSGP /LGKSLLP I 
PVALCFADLEVFI PKGGMLS PGVTI ISL 
ITWKriRLPWVILQ*RITPLaRVIPPDYQR 
EIRLLLiaWERKKSYVWNTRDPLGHLVVIi 
PCPMVIK 


1595 


15496 


A 


1603 


288 


8 


EPPriIREMQIKPTMRYRIjTPV*MD\R*W 
*GCGKRGTL.VHCWW* CKLVQSVCERSSK 
NITIELPCDPAVLILGMHTKERKIjRPCG 
WTRGVGPGxC 


1596 


15497 


A 


1604 


411 


236 


l,IPSEHFFFSG*F*AFDRSSriAPTPQLR 

rhrpptgllplsllqgpXlltpsvlls* 
ggrfkgpnftppggqgkifpmgppksnp 

GPGV 


1597 


15498 


A 


1605 


2 


437 


kclrispcagprrpwcpsfeprvcvwpl 

GVPQ VG / PEGP * GEG \GFEGGDVRQLWQ 
GKKKKKKNTLGKRGPWFPKEKRGFSLTP 
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NO: of 
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26 


Predicted 

beginning 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, E=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=StDp codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














WFGPQGLWEPCLNPGGPLFIiIGGFQKGP 

KVLIGKGKTPGPKKKNNRPRKRGSLGPP 
GKDDFPL 


1598 


15499 


A 


1606 


31 


467 


EFPGRRFRGGGCSELRLRVCTPAWTTEG 
DSISNNI *LWEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLEIirjTSSDPPASAYPK*WDYKR\ 
DPRAQPNPNILKAQSNRSPPRQR*PASS 
FSNQNS CMHRCI SLP * VLFL I FFKNRP * 
NlSrSFGYNSSGTOSGLNILlsrSC* FWFLFG 
NSDTEIRICRSFHRGREV 


1600 


15501 


A 


1608 


379 


3 


FPILSPSYNPSSTEIiP***SLQSPLFEP 
YDFFSVNMFSFLFFNLKSHTVAQAGVQW 
RNHGSLQPRSFQLK\HPPASASAGTTGM 
CEiHAW/LIYLCVYLFLEMRSHCVSQD*V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNWGLGLGCGL/EPTMTP 
SGLGLICGSPS*SPPAHRLSGAQLCPVLR 
APVLGPSQMPPGRKKKPTARGSPWRKGV 
FVMSGQSDPLGPSSCQELGPRQSTQGTF 
PGA 


1602 


15503 


A 


1610 


1 


420 


FRFSDGAAGQKCSS PPRPGRGRAEVLLT 
SQTGRQGRGAPHISDNGQPGRDAP\PS * 
M*WRPGRGAP\PS*VGWRPGGDAPHF\Q 
TGQPGRGAPHIPDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTA?iAWKVKYVTSriR 


1603 


15504 


A 


1611 


426 


3 


KNPFLLEAKVSFNPKWPPAIiPPGEQRDS 
VSQKKKKNLLIHKKAHSKH * FF I CRE \ C 
ESALLLHQNIHAGGKSYVCNK^GRGFRN 
KSHFTYQRTHSGKKAFL*KECG*DFL* K 
AILTAYQKTHSGKKSFVCKECR*DFTQK 
TK 


1604 


15505 


A 


1612 


428 


273 


HHA*LIFKIF/CVETRVSLLCPGWS*TP 
EI,K*SSLLGCPKCWDYRREPPIRPTT 


1605 


15506 


A 


1613 


311 


4 


ANKFKWLNEIKFPEAHNLPKFTQEGGLN 
NP VS L Y * KN* TYS LRLFP \ KKKKS GPDD 
FTGE FN* TS KEEVP S LHKLLQKI * ERNT 
LPNLFHKARVTQVS KS EM 


1606 


15507 


A 


1614 


338 


88 


PNPPPPSKGK\GFPPPTPGRKKKTRPPP 
PPGKFLGF * KKRGFPPFERGGP \NPAPG 
GPPPPNPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A 


1615 


2 


162 


KHGGTCLYSLLLGRLRHESCLNLGGGGC 
RE/PE / SHCIPAWATE * DSVSEKKKYL 


1608 


15509 


A 


1616 


3 


399 


PSVREYLTSRGLPFKVLLILDNAHGHSE 
PQRFNTEGINVFYLIiPNPRSLIQTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S*KSRIMTPL/IDAIVMTEKAMEAIMPK 
TIISCWRKLCPDWHDFTRFT 


1609 


15510 


A 


1617 


390 


1 


KRNCFGPFLTPPPGQRLGGFKFLK\HPF 
YYRGERGGFPFSPFNKGFPPPPPFWGPF 
LGALKF*RGVPPSKPPPPGPGKKPKLKP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 
KKKKNFTAARDLEPNAW 


1610 


15511 


A 


1618 


468 


0 


MKLVNIWLLLLVALL*GKKHLGDRIiEKK 
SFEKAPCPGCSHLTLKVEPSSTVAEYEY 
IVAFNGYFTAKARNSFISRALKSSEVHN 
WRIIPRSNPSSDYPT*WP/VALKKKKKK 
AGV/LPLENYS I ITRV 


1611 


15512 


A 


1619 


421 


2 


SSRLSIiPKCWDYRREPPRPAQPRILKKI 



634 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G==Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N==Asparagine5 P=Prolinej 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














S* *EPFFXjILKCI * IjRTQGLGRGHKGKIi 
NSTYVLHKFMD* *LGQLMNGLNMLDESS 
FCNVLKHTKQXWRDYYFVCIiFVCFVRQS 
L/NPVSQAGVQRRDHGSLRAPPPGFAPF 
S 


1612 


15513 


A 


1620 


406 


2 


GTKKASSNGFINGPGAAKKKSQTFAFLA 
KPGAFPSILLDEKNRPQIKPPACFP/SI 
RESQTSYPVFFFWSPSCGFFFFWGKPEP 
riLLKEAGKVPLFLFIiFGFL*DGVSLCQP 
GWNAVLRSQLTAESNPHASAHAS 


1613 


15514 


A 


1621 


411 


0 


PPPPKKKPSGPPPP/PSSSSSPSPPRKF 
LGGPRVFFPPPFFKPPPPKNFWGPPKKK 
KFPPPPGGKKFFF*RAPPP 


1614 


15515 


A 


1622 


2 


403 


TARCGIiNFPCSSSLPFIAS * VAGTTGTH 
HHAQL ILLI FCGDELSLM\ CPGWS / PNS 
SHLGLPKCWDYR 


1615 


15516 


A 


1623 


298 


411 


LIINVCWPGLVAHACNPSTLGG*GGRI/ 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DS VS KKKKNFKE * L I PVLLKLFQNI EEE 
GILPNSLYKAGV\LiIPKPDKDT*RKEIY 
RPISLINIDAKIVSKILANKIQ^FIKKI 
TDAW 


1617 


15518 


A 


1625 


3 


281 


PFSCLSIiPSCWDYRRPPPRPANFF/VYF 
YKKNTRTQCFTVKHGFTVLTRLVLIS * P 
CDPPSLASQSAGITGVSHRTQPHTVFFL 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


NPGPRGFPPPGPPKRLiDFRGGAPRPGF* 
YFLKNFLGFFWFQKNFLVFFLGNKTPFP 
PQFFFFF/RDRMSLCHPGWSEVAQAWLK 
AALTSQTPAI 


1619 


15520 


A 


1627 


394 


40 


PQPAAASLFSPGLFFFFPPVFSPPPFFK 
TPPRIFFFWPP*KNFFF\PPPAFFFFFF 
LGAPPFFFFFFFFFFFFFF*DRVSLCRP 
GWSAVAQS*LTAALNSQT 


1620 


15521 


A 


1628 


386 


3 


IFPTRCTHIiHLGTNKLPTCSFFDQAKKN 
PFGS \HHSRGVGLRARLFCERLTIEGAG 
TPACPAP * FPGEPTRP/PGRVRWLiTPAI 
PALWEAETGS * YVARTGLELLVS SNP PL 
SASQSARITGVSHRTWP 


1621 


15522 


A 


1629 


401 


93 


ARGVLPUSrPP PWGGRGGGS P *GKNSKP P 
RPRGENPPPFLKPQKLPPPGGGPPPPLP 
LGG*SKKTPPPPKGGAPINQKPPPGIiPP 
PGKKGAPFPKKKKKKDKNIRTKKKARRS 
GSLLQSQHPGRPR/RGQPPLTKNPPLAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


PPPGTISSPNP/QKNIiKKGPGPGGNPRN 
PPPLGGQRG/RGLWAKKSRPPGPPRGNP 
PliFKKKKINGGGGPPPWPPPRGARAGK 
SLyPGGGPPQ*PQMGPPPPPPGAKKGFF 
PKKKKKKPKKPRKT 


1623 


15524 


A 


1631 


416 


54 


EyWCKGR*IDQWNRM*FKMDQHLHSQFI 
FN*ATKAIQWGKESLFNRLCLKNWLTIR 

DK\IYLDACLTTY\*KINSSWHSGAFL* 
SQILERIiRQEASLSPGI*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 


373 


LVFLDNMLKLLRHSAIiASACFPEDIjAKI 
MDEGGYTKGQ I FKVHGTAF CWKKMPSRT 
FWREQSVPGFK/ATD*IiIiI.GANAAGN\ 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=phenylalanine, 
G=Glycine, H==Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^'Methionine, 
N=Asparagine, P=ProHne, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyros!ne, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \— possible 
nucleotide insertion 














KLKPMPIYHSENPRVFKNYVKPILPALY 
TWNSKA/QMAAPLF 


1625 


15526 


A 


1633 


3 


383 


NI WN* KAWMTVHL / FIT* FTE YFKSTVE 
NC*EKKKILFKI /LLLIDNTFGHPKALM 
EIYKEINVIFMPANTIS IL*PMDQGVIA 
TLNSYYLRNTFHKTIAFINYDSSDRCGQ 
SQIiKNFWKGFS ILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQLIGRLRQENRLN/ SGDRGCSEP I 
LHLCTPPWATE*DPVSKKKKKNPCPKLK 
KGPPP 


1627 


15528 


A 


1635 


331 


1 


LFPPPPVLKSGPGPN*N*PPLKGKSFGP 
LKKKFFFFFF\FRDWVLLyCLGWSQTSG 
LKRS S CWDYRCEPPHLTPNF / S YF/ CRD 
S VSL * PRIiEGSGS I lALCSLKIiLDSSS P 
H 


1628 


15529 


A 


1636 


80 


381 


KLSKCHAHHSRGFYKYSLFQLGASQFPQ 
VLRITHPHKGSWAAPRS*G*SQC/SHFF 
SFFFETKSHFWQAGGYGRNFT*LQPPP 
PGLKRFSHLSLPSSWDYS 


1629 


15530 


A 


1637 


381 


38 


SKRQGFHHVGQAGLELLS S S DliP TMALH 
PTCPLQKCWDYRC/DATAPNLSSTFFMQ 
EKCIiAFL\PLFSSAPHSIi* *PKAVIEKD 
CPGL* IW* *VARSQRMNVFNFIK 


1630 


15531 


A 


1638 


3 


295 


PGPDGFTAE FNQT FKE * L I P I LVKLF * K 
IQEVR\FFKFS ITLTPKSNKD 


1631 


15532 


A 


1639 


48 


380 


ILGKAI S FT IS * KGLKYLGI YLTKEAKG 
LHTENYKMLLKELKKDTNIWKGILCQ/W 
TRRLNMVKISV 


1632 


15533 


A 


1640 


343 


23 


SWLTAT/FCFLGSSDSPASAGVELLTSG 
DLPALAS*TAGITGVSHRTRPALSINTS 
TLLCSSPYCPPHLQSLQGTTQPVPFLKA 
QRTGCLLQGVILTSQAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LELFSSAHCCPSLTVMQYYPPXRPTSHD 
GQREKSPHRTKKKKPL.EGVFLG* KIKTN 
LENPPPLPFFGGGPPPKGGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPLRAFPLQRFFRGPE 


1634 


15535 


A 


1642 


2 


308 


NKWR* GNWI SVKIDNFNFYLMPYTNLS * 
IRDDI/NAKATTIKLVGENIGENLGIGIOSr 
F*ERTIiKA\LRGKKMDKLDFITIGNFCF 
SKDRIKNKNKARRG6SRL 


1635 


15536 


A 


1643 


16 


386 


EKKKLSLFTENGI PYL*NPKESAKRLLS 
LXNDFSKV*GYKNW\DEKSVAFLNTOK 


1636 


15537 


A 


1644 


533 


3 


PLSLSLSSriFLSFFL*DRACFVAQAGVQ 
WLDLGSLQPPPPGIK*FS/CGGNVAVTP 
RLSPLTLPAMTEVRIiPSSKIQTNKEKNT 
VMSEIYQSL.M*MGWG/M*VRMCVKFKKP 
WVDN* * VSVMGCLLPSFLPSFLSFLFLS 
PSPLSFFPYFFETESCFVAEAGAQWLDL 
G^LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GIGGRPP * FQLLGRLKQEK/HLERGKGF 
NEPKSRPCISAWATKGDCFQKKKKrHMT 
LQQIQPKQSWKTDISQFRTDYKTTIIKT 
VGHGI W ID I F INGI ELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNEKDNNKCWLTI \GKI *MLTGCYW 
ECKMVQIiPWKTAWQFL/R/DWI*LPYD 
liAITLLS I YTRKRKTYVYTKTCIQMFLA 
VLFTTAKRWKQP 
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nucleotide 

iocation 

correspond 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
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residue of 
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Amino acid sequence (A^^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, &==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1639 


15540 


A 


1647 


392 


1 


QKLEIMLREEGMLKAKTGQKLGIiliCQAV 
THLVNAKEKFLKLNVLLQGTHKW*ESET 
AKSLI LFNSMKAEKGE *AAEEKFAASRG 
WFM/RFKERNCLHHLKVQGEAASAWEA 
VANYIEDLVKKIDKGGCIN 


1640 


15541 


A 


1648 


425 


148 


SQSLI * SKALTLCasrSMKT/ERAEEVAGK 
KLEAS]SrF\LKFKKRSCV\RNVIirMQGKVA 
SLDGEAAASSPEDLVNFIDEGGYTKQQI 
FN-*DKTSFFF 


1641 


15542 


A 


1649 


271 


462 


RRQKKRHKRCILSPIiLFDIVLEVPARTI 
*QEKKIKGIQIGKKEV/KIISLFADDW 
LYL 


1642 


15543 


A 


1650 


70 


398 


RPEASLRHMCIiNAGQLIjSKRAKLGALSL 
SPFFWKKSIiAFAPQPGGQGGNLG* *KPP 
rjPGLRGFSGIjTLLRN/WE*RWPVPPPT\ 
NFGPLIKTGFPLVGQAGFDLRTLGALR 


1643 


15544 


A 


1651 


425 


3 


FEGFPKVGPPLGPRFPSRVPPFGTLPPP 
PKRGPRCFPQPGAPPPKIWTPPGALPQG 
GVGPALPG/ALQKFGPKWPGGPFSGPPQ 
MAP\GGFPGGP*RPPRGGAPFPFFLRQG 
FPVAQARVHIjPGSSDPPISAPQVAGTTD 

vc 


1644 


15545 


A 


1652 


385 


2 


KGNNPSPPSIKFFFFFF*KGL.LPLPQGG 
GQWGYFRSLQPPPSRLKLFSCPNLPSNW 

eyrgp* \rll*litgrgtsgsktk:vptp 
cgpfnlnqlglgtqarnfs / rlvffpfe 
tespfvaqagiqlrdlhs 


1645 


15546 


A 


1653 


242 


3 


knknfg inrgffftfkvpgfffflgkvk 
lffffrnlkfflkakpp*\vffpigpsf 
ffffffldrvllccpgwnvwqsl 


1646 


15547 


A 


1654 


3 


285 


HF 1 1 YT KDLiNRR FSKEDI*III KH.V/ KK 
CSPSLAVREMQIKTTVRPGTVAGTCNPN 
TLGGQGRRIVQDQLKQQSKTSSLQKKIL 
FRLARHGGTCP 


1647 


15548 


A 


1655 


1 


373 


KVSLFFFFEQGLLCCPGWSAWSSLQPQ 
CPRVKQFSHVS/LPSW*EYSCTPNTFSL 
QVCVSIHKYI*YIYIYrFKFF/CR/DRT 
LARLHRLVSNAWSQAILPPWPPKVLGLQ 


1648 


15549 


A 


1656 


189 


2 


VQPGQQERNS ISKTKNKTKQKLPTKKSP 

gpd*fkeelipxlhkl/f/hkiedkgtl 
hnsfywti 


1649 


15550 


A 


1657 


385 


13 


ggppp wgars ppkii *npppprkkppgpp 
pppggapplggfflffpppggpppgggk 
/s fsprffffffplggggqiflps pggg 
prgvfsppapplvpkpaffskkkkrgss 
ggekpeadgyfik 


1650 


15551 


A 


1658 


352 


2 


hlsllsnwdsirapp/rpcvifknvptin 
iffl*rgvtmlprlvlndppisasqvar 
iidvshwaklrrsv/cyvfetgsgslsq 
agvqrynhgsvqpqpsrvs*sshhslwk 
yryppprr 


1651 


15552 


A 


1659 


265 


3 


hsgqrdegrmrcxsewleshgwrarscm 
tlktsflftmaki*nqlkclfmdewikk 
mwhihtmeyysaikr/de i psfvatwme 

LEVIM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKT I EKI IGTKT * LFKKI KKFD 
KPLARWT/RGKKNIQITNIRN*RRDVI 


1653 


15554 


A 


1661 


56 


320 


KFFMYSAGESTKIRCIiF/SCLFLFLR/Q 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESRSVSQDGG*SEPRSGYCTPAWATE*D 
SVSKKQNKNKPKKNIiNKCFASSIiLiITLP 
PARHKA 


1654 


15555 


A 


1662 


3 


421 


GI ITDTFPNIiEKGINIQVKKV/RRPPSR 
FNPKKTTSRDLI liCLPKIKDKGS * KQKE 
KTSKSARLPQPHGLLGLGLGKSASSPIK 
KERKQ\QITYSGAP1PLVTDFSVETLQV 
RREWHDVFKVLKEK/DFYPRIVYLVKIS 
F 


1655 


15556 


A 


1663 


362 


2 


VIFADEAQILKKEDKLDFIKMKTSVHQK 
TIiSTEYKDNHREKIFVSQISDKELISRV 
YKGLLKLWKrTDKNLILPyP*RLGHSVTQ 
GRVQWCNHSSLHPQTPGI.K\NPPTSAS * 
AAGNTGVHL 


1656 


15557 


A 


1664 


79 


355 


IHLPSL IGDFNLFTGI SLLI CLVIiFVF I 
FETGSCSVAQ/S/GVQWHNHGLLQPRPS 
GLRQSSHLSPLSSWNHRHGPG*FIYF/C 
VEIRAHYHP 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLXN-FFFFPKLCKLAPPFFrjPF 
FLWGGGGGKFSRNPQKHPPR* KRVFVNF 
FFFFFFIiRHCLTVSl,SHCHPGWSAVA*S 
QLTAGSNFW\VKQSSHIjS/LPSSWDHRH 
APP 


1658 


15559 


A 


1666 


163 


601 


ifckggvlpccpgladlhfstsnsisfy 
yssgllrmtnktetpmstipkgvgvawr 
pgnse c i fqelpltlhhllstmlas f ih 
sheasanalvgrsltvgwgcrgvgvsvd 

PAAWLWRDLKGCF*DKSHSVTQSGGQWC 
NLSSLQP*APRLKRSCrLSLPRSWDHWH 
VP P FLANF * I FCKGGVLP \ FAQGW 


1659 


15560 


A 


1667 


418 


3 


SVCLGLPKCWDTGRKPLCPAPSFFY/EG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
DAKILNRIl^ANCISN* r *ISrH* \*KVKFT 
PGKKDWFNNRKPTDI IYPH*QNREEKSL 
VS S TDMAKVFNKIQ P VLRSLT I E I KGNF 
LNL 


1660 


15561 


A 


1668 


411 


1 


LRIiHVGRTTT \ LFKAVSQGHLSLQRFLL 
PSVEICPAPRGGVYRGRQASLSCGGLHR 
VRASQLLWFLTQASAMAGPPPPVLLPPC 
SLI *DCCANNKGGFIGVGPFEPCVGYNL 
LVSHLLRPSEKPSIRVGVT*FSRC 


1661 


15562 


A 


1669 


151 


1 


PliEKEAE I TGP * PHAWL I F/ CFF * TSSR 
YIAQARMKWHNLGSLQPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TPF*KLKMINLSEEGMLKAKISQKLGLC 
\TVSQWNSKEKFLKEIKSATPEST*MI 
RK*NSSVANTENV*ERSRTSHNIIiFS * S 

S IQNKDLTFFNSLKAWRGQEVAEEKSEA 
SSGWFMRFKERSHLHTIK 


1663 


15564 


A 


1671 


1 


363 


ectgpkiakiilekklsiwggiipi/pnfk 
i*ykapvi*pwlkvpvipsssa3:lmktv 
*y\yfkdrnqdewyrlrvwkinshiygq 
li\pskgtktiqw*kslfnk*cwnwwlf 
tckrmkld 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS * * IRRCWI I / KCLFSPKM 
N*VF]SrVIT/lQCPSGHFFTETDKSILKF 
IWKSK*PRIiAKRTL 


1665 


15566 


A 


1673 


15 


378 


NYHHNQNNEHMYHSLPNFFFFFFGKGAP 
PG\ PKGGGRG*REP * I PGGKGNPPL* PP 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G-GIycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q==GJutamine, R~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possib1e 
nucleotide insertion 














KEGGPPGGTPPPGKPGKIWDFWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKQPGPKKFF 


1666 


15567 


A 


1674 


2 


349 


LFLTVLARLVSNS*PQ/CDPPAIjASQSA 
EITGMSHRAQPRGXR* I *CGS YTTCHPL 
LLLT \ YLF I YFS TESHS VI QAGVQWHDL 
GSLQSQLRRLT/CILTLPSSWD*PHVTL 
PG*FCIiF 


1667 


15568 


A 


1675 


348 


78 


LGPWVPPVIPAPLGG*GGRSPRPGFLTN 
PGPK6KPRVFLKIQNLPGV/RGRPPLFP 
/GS PGSGGPKS P * PLGPS FPLT*NFS PP 
FWGPPG6PK 


1668 


15569 


A 


1676 


2 


385 


TSRRDYRP \GHHNQLI F* / S FCRDGASL 
CCPCWSQTPGIKKSSCLVFPRCWDYRCE 
PGL* I *MGKNPT\riFSNGL*CDCIPLIH 
SIADIRKKPHS *LQGL*LCHQQNSQTES 
CS VTQAGVQ * CDLGSLQP 


1669 


15570 


A 


1677 


386 


1 


KSTRPVLYKWNNKA*MTEHLFTA*FTEY 
FKATIETFCSEK* IPLKILLLICNVPSH 
PRALMGMYKEINWSMPTDITGILQPMD 
QGVISTFKSYYLRNTFDKAIAT/DSDSS 
DGS GKNLLKTFWKGFT I 


1670 


15571 


A 


1678 


2 


193 


EGGRI FFNS FFEVI LTLI / PKPKKWER 
K* S YHP I S IMNGDVKILAQTLLNQIQQY 

LKRI IHYDS 


1671 


15572 


A 


1679 


561 


830 


TLLLGT\NAWDFKLKPMLTY\HS \ENS 
RGL^KSWINLGLTVFYKWGTNNAWGDDR 
HHLVYRHGFTGI F * GSQLKTYCSENI PF 
KILLFIDN 


1672 


15573 


A 


1680 


415 


2 


TSCAWLSLYPVYLYRSSSLPRFIFCHFK 
HLWYKNITKKAAEINTNFCCYINRVLLC 
HLGWS/ATWSS + LTVTSKLLGSRDPHT 
LSLP S S * EGRC I T PRLGDLLNLFSRDGG 
GIRGTSTLPRQVLNAWPQAILLVRIT 


1673 


15574 


A 


1681 


1 


78 


RPRIRHEVGQAGLKLLTSGQTPASVP * C 
WDYRPEPPCPALHISYK*NHAMCGLKCL 
AVSA*RHVX.GFIRG\WHVECCFPFCS *A 
GLKLLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPIPGA/AGIiRPPPPPLGNPPPP* 
KAKICPGGGAPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPLPFGLGHKRGLFPKKKK 
KI 


1675 


15576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPPSPGGGQWGPLGNPRPPGAKGGST 
LRGPRTGGSRGGPPGPGKFFFFLKQSIiA 
Li 


1676 


15577 


A 


1684 


3 


374 


GISVLPGIGAPGNKPELFEEVKLYNNAR 
EREKYDNMAELF/AWKTMQALEKAYIK 
DCVSPSEYTAACSRLLVQYKAVFRRVQG 
SE I SS IDEFCRKFRLDCPLAMERIKEDR 
PITIKDD*GNSLS 


1677 


15578 


A 


1685 


2 


373 


PFIRPETIKLLEE I PGGKLLDLGLGNDL 
LARTPNAKINTWDHIKLKSFCTXKETIN 
TMKKQPTDCEKIFSRriISDKGL.M/SQMC 
KELVQLN*KK/TDDSVKGWAEDIiDRHFS 
KKDIKMAbTRPGKVIi 


1678 


15579 


A 


1686 


378 


201 


HATCriANF\CSYG*DRVSPFCPGWS*TP 
ELKRSTRLGLPKCWDYRCLGRSIiliFPGA 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenyIalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Iso leucine, 
K^Lysine, L=Leucine, M='Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPDH 


1679 


15580 


A 


1687 


1 


404 


MCWPGGGACSEPRSCHCTPAWVTERDS 
V*KTIIKKKVKRRKKIFHA3SrS /KLKRVGM 
AVLISDKI YYKLKKVRRDKEYI IKGSTY 
QEDIS ITNTYTLNT* APKYMKDTLTELK 
EEIDIYAIIVGDFNTIjLLIICRGT 


1680 


15581 


A 


1688 


307 


33 


DEGSCHDAHAGLKLIiDSSGLPASASQSA 
GILGVSH6ARPLISSYNGTSHGGLGPAL 
VTSFNLSHLFKDIiI,S]:iQI/HVTF*GMGL 
GFAGAKLSL 


1681 


15582 


A 


1689 


11 


394 


IFILEARTRISSRTFFKS * INIHi^YNS 
AI * L I GI FPREKK/STCP YI YTQMFIAS 
IjFVTAQTRKQPKCPSTGE\WSKNLWN 


1682 


15583 


A 


1690 


20 


391 


SEGKGWYSCTKWQSMKLGGITFFFFFPP 
PKNPPPKKSGP/QKGPFF*GKGPPWPPK 
KRGHKN\RGFPKQRARPPKPVFFLIPGK 
RGFLLGPKGG*N"PGEKRN"PPPWP*KGGK 
NPGNPKGGPHLTL 


1683 


15584 


A 


1691 


72 


392 


IKMIGSLFFGFAFFFFFGKKTPFFPQPK 
R\GGKP* IT*TPPPGN*RNSGPSPPQKV 
GIKAPPPIiPK\NF * FFGKNGVTPFPPGG 
FEPPTPKEPSPPVSPKGGKTNPAP 


1684 


15585 


A 


1692 


389 


161 


HGGACLRSQLLGRIiRREDCLNQGG*GCS 
EP CTP AW VTE * D / S S QKNQKS KI KKSGL 
DNSFSIG*GILGLSTCDRYS 


1685 


15586 


A 


1693 


286 


363 


DGI Sli* PRPPGLRQPSQPS \LLSNWGCR 
STPLCLAGFFVFVETGFLHVAQACLRQG 
FTMVAQS / AS QK I H I T * GAFE T I QVL F Y 
WGGVGFGQWRFFFFFETESHSVTQAGVR 
GCSLSSLQVPPPG 


1686 


15587 


A 


1694 


1 


356 


ELLEPRGRGCSERRPCHCPPVRVAEQDS 
VSKKKRERKYLYFLRFNWRSLRIFIYFC 
* /HPLQHNIQNISFTIiQNSFGFFSRQYC 
PS PLE I I FLTPLTENLLGLFMKGI IQNW 
FFGVGLF 


1687 


15588 


A 


1695 


3 


298 


KYFETNENKNIQYQNLXAVKLVFRENLT 
VWACVTCKEERFQV]SINLALYPKN*EKSM 
IiNPKGKI IKVRSEKNDIE*KNDEENQ*N 
* SWYFEKI TYWQTLD 


1688 


15589 


A 


1696 


3 


405 


RLWCGWRNRHLGS *NRVENPETGLHRYA 
QLI PLTKVQKQVGEGQPFNK* CGGTWAP 
TGKT/MEQPPKASSSSSSSSSSSSSSSC 
KM*NIVFFKMGENLWDH*AKSYEVRTKA 
* TI KGKVDKLDFI KIKHFC YGKN 


1689 


15590 


A 


1697 


6 


392 


LQRTLIiVGLFNAAGNLKLKPMIjICHSEN 
PRALKNYAKSTLPVFYKWIKIAWMTV*T 
VAAWIiTEYPKPIFENYCS/EKKIPFKLF 
LliTmAPGHPTGLMEMYKTTNVGSLPAN 
TTSVLQPMGQRVISTFKS 


1690 


15591 


A 


1698 


390 


3 


AI lESDFLTTSREVAKKLRVHPFMVLWV 
LEP IGKVKKVNKWGP * KLNK/NKKNHCI 
EVSSSLFLCNWEPFLDGIVTCDEKWIIiY 

HisTw* *saqwlnrevapk;\hflqpnlhqk 

KVTVI VWWSAAGL IQNEH 


1691 


15592 


A 


1699 


1 


245 


gggge ys ki lai ktalknt t ylg i yl i k 
\dvpdlytknygtmlreikylek*rarp 

CS *TERFKI VKMS I /LPNL I YRFNTI 


1692 


15593 


A 


1700 


2 


324 


gtsgtsgtsgtsqxcrisrvrstsswts 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histicline, I=I$oIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRHLQCHVSKAES S WPVLSAS SCSTGS P 

ISLSGS/GSSLSNSSKNLTLPCLPSSLL 

S*P*A]>TPVDSAFKIY?GLSSVGSCL 


1693 


15594 


A 


1701 


183 


372 


PGVLLCWPGWSAVTVHQCDHSALQPRTP 
RL K* S P \ S T WD YR YTP LCPA 


1694 


15595 


A 


1702 


29 


382 


GKRFPCSWEPKFPLTQGGFPPLHPGQKK 
KPGSKKKKDKSKCW*G/CREA/GTIiTHy 
RQELEMVQSFWK/TVWQVLRKItDTVEIiPY 

dpafllglhptstqkrXdtmfma/aitl 

/ISERWKQPRCTS 


1695 


15596 


A 


1703 


1 


382 


KKVKIIGEAAVEFPDTIKKIIEEKEYLP 
L*VY\NADESGLFWKKKLQRTFISKEEK 
\SMDRLTLI ILCKCSWVYEQDGP * ALKE 
KGEHQLPVF*L*NKKAWTVRTLFLD* FH 
QCFVPEVRKYVASKRL 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLNNS *A7NTE 1 KAEVS S SLLE IN 
EYEDTTYQNLWDAAKAVLKGKHVAPRHF 
LQEVKK/RLKRFQINNLTLYLKEL/EKE 
HINLKASGRK*MTKIGDLFGLYFVLNG 


1697 


15598 


A 


1705 


100 


342 


APKWSIVCPELVGSWSH*/PSRMKPWTL 
TRQGFTMLARLVLNS^RRDLPALASQSP 
G I TGMSHRTQPLL I NLME I PTE I LS 


1698 


15599 


A 


1706 


600 


211 


SCSVARLEFSDVIKAHCHL/RTPGLiKQS 
SHLSHLSS\WD*GRVPHDLANF* IFCRD 
RVLPRLLQAGLEL\liASSDPPS *ASEKC 
WNYRHEPTVPRQNLGLLKTTYGWVFLKK 
YILiTVSVFSLMCLDHCLLiyi 


1699 


15600 


A 


1707 


409 


I 


RGPFPFS PGKAFLKIiGSl>jrS FFP I KKTRA 
PKKPIFSPVSP/LNFPPKTGFPPVFPPQ 
MGGFFF/CFPSFLFFFLPPPPPFFFSFP 
PPFFFFFFFFFFFPLFFF*DRVSLCHPG 
WSAVALSQLTAALTSMDSSNSPTCV 


1700 


15601 


A 


1708 


154 


2 


IGKPIjAGLTK/RKRENT* INKIRNEKGD 
MTADNTE I QS I IRDFVS * RTAHQ 


1701 


15602 


A 


1709 


263 


37 


saqlhplniqnhrqsillhdfflkkqd/ 
g / wpgagahacnps tlggqggwi trsgd 
qdhpg*hisv 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAI FFPPLPREGAKNPNFP VGPP 
PFGGPFFFFFAPSQKKEPGFFGKKGVF* 

ggakgfpnargpfff/g*kkkkgpknkt 
pgffl/mgppnpggppprgeggkigakk 

KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCLCDCMC/ CVCACMYI CVCV 
CTRVCM*VCVCMCVRVCVQALTVLCKSV 


1704 


15605 


A 


1712 


116 


391 


krnfffgpqgggegpkfn*rgppppgvk 
gifppsppeg/gkkkgappppgiifwff 
kkkgvppcgpggv*tpdpggppppgppk 
ggaprqgpl 


1705 


15606 


A 


1713 


401 


47 


HHYATKPFTHAHTCTCIQDTCNTCMQHT 
QVHT/HTDTHTHTRKVSSVCVIalMAEQE 
RPCPHC*GGEAGAECGVCAWLGLSS *TW 
RNRKGPAHTVRWGQEQSAGCAHGLVFP 
RSYLWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVDIEMVSCCV/CSG*SAVCSGT 
AS AHCSLP I PGSRDS PASACQVAGTTGM 
PH/LYPGVPLKPREGLQFTELPSGQIiEI 
QPTCENKR*HVPCAL*VQIiTDIRPR*RY 
QFRVAAVNVHGTRRFTAPS KHFCS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspart!C Acid, 
E=Glutamic Acid, F^Phenyialanine, 
G=Giycine, H=Histldine, I=IsoIeucine, 
K=Lysine, l^Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^'Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unli;nown, *=Stop codon, /-possible 
nucleotide deletion, \==possible 
nucleotide insertion 


1707 


15608 


A 


2715 


413 


3 


LNTFEPQRSPFIVUSSEMILIiVECLFVT 
SGWIYHERFLNPGREIDWATCYSETGPC 
PVTQDGVQWSimGSL*PQTPRLK\HPTT 
SVFPVA/RQGL/NSVAQAGVQWHDLGSIi 
* CRLPSLKGSSHPRLPSSWNYRYAPPPC 
I 


1708 


15609 


A 


1716 


421 


144 


RLECSGGITAYCSLNLPGPSSSPASAS* 
VAETTGLLHWKKKIVETGSHFIVQTAPK 
FIiDSSDLPALAC/SWDYRSEIiLCPACFY 
NFCLFINIPC 


1709 


15610 


A 


1717 


3 


384 


YSACVCVCVWVDVSVHRCMSISGH/ARI 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR*GESLPGAYPGSHSTC 
CPGCCLDW*HSLPSTEPAACRAGAPGGG 
H*AWLGCGGRAGGRPG 


1710 


15611 


A 


1719 


3 


615 


PVGSWARSSGAGWPPGSPQSVSDGEAGH 
S IPAPRGQCSRHRAEGRTARVCLSHCSF 
SGPRPGLVPIR*SLGRPDAVQAIVPDSQ 
EGRKTGIHAEAVMFPLGPGKGVKGCAGG 
RIaCPSSPPSRCLDLGRRGMPPSSGPAGP 
RPSGVGSDLGRPGAGAATS SSSSSSSSS 
SSSSSSSSSSRDGEGP\TGSVEAPGSLL 
GPWLPSQPS 


1711 


I56I2 


A 


1720 


320 


3 


GLKWLWLKDNSL YL r LAKWDDCLDEK* 
YADKMLQYTKSVWVSRSRGCQGQVGEKK 
QATEAAQEWELRK\RtiYWRKECDALRAA 
REEQKELRDVRKAKKVYVCVRV 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFLTKEEIGPM*QSCP 
T/DLAIiVFLGICTIDLKAYIHTETCTQM 
1 1 ITLLI I AKNRKKAS CSS VGE / WNKKL 
Y YIRTMES YS SLR * WELSS YKKHWGGG S 


1713 


15614 


A 


1722 


135 


396 


AQGLFCTSVKLASEQPLRILFQLDKRNK 
\ FEI YGTSG*L*S 1 1 1 CQlSTNIiQSKFQMY 
HHKIMS SLGAVAHACNPSTLGGQGGQI T 
RSGD 


1714 


15615 


A 


1723 


4 


383 


LNLRAEAIKLLEENIGIHLPDLGLDDLF 
LDITPKSQATK/AICIGM*GFIKLKHFCA 
AKNI IKKMKRQYKEWKKIFGNHVSDKKL 
VFRIYK*HLPLIIKNSSS 


1715 


15616 


A 


1724 


2 


405 


NSRTSLILNQNLE/IKLSEKGTWKAKTG 
* klgllqqkX ISKIANAKEKLLKEVKSA 
TSMNM* iyiiyiRK*NNF I PEMQKVLWISJ/ 1 * 
NX /PIjCQSIjIQNKALTIjFNS I KAERGEE 
A\KLEATKRWFMRFKESCLHNVKVQDEG 


1716 


15617 


A 


1725 


90 


400 


sqllrrlkqnslnpresssktttrtkvs 
hwhknrhvnq/ynkiensginlhiyg*l 
tlnkgdeasqyss /dilifnkwcwqkkkk 
\yldpyltpctkisstwtisgfl 


1717 


15618 


A 


1726 


390 


1 


TFFPKI \EKKIL*FIWGPRRPKIANVFP 
*QNKPKIEGIPLPGPKIYYRALVTKTAW 
P*HKNPPIGQRNKVENSETNFHPPSEUfT 
FFPF\YKGAKNIHWGEDSLFNKWCWENW 
I S I *RRMKXiGPNLTP YTK 


1718 


15619 


A 


1727 


3 


365 


HASAKSmiRWIIKIilSTLRAKTIKLLEENM 
GENLWDLEIiSRGFriDRT/PKLYSIKLKI. 
RK/ WNF X KI KNFCPSRDT* KS *MAGKNI 
LSGKDLYSEYLKK* CNLIIRQSKTSr*M 
FIAALFIIAKH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaJanjne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=*Serine, 
T-Threonine, V^Valine, 
W=Tryptaphan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \— possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
SVSKMK*NKKINK*IMFLY\EQQSETEY 
FKIiPLTLA/SRNMKFLGIWIiSKGVQDIiE 
TENYKILIjQEIEEDDNK*RNKTY*WIKL 
NTVKMKSILSKR 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREIKPPIiPPMGKPLF 
F^KTKPTKWVGGAPPVIPPS/SGVXiSQKK 
TFTLEGGGPNKLNSPPALP/ARGPKKNF 
FQKKKKKTKQWPSQREE 


1721 


15622 


A 


1730 


1 


374 


I FNADKI A\ FWKKF * KPQGTS VGREEKQ 
APGFKAGRNRLTTI/L/GASAVGFMIRA 
AlilCKAANPQVLKGRDKHQLPVFRLLYN 
KKAWTTRTLFLDCPHQCPVPEVRKYLAS 
KGLVFKVLLILDNGPC 


1722 


15623 


A 


1731 


389 


1 


FPPKIFFFSTLFFFFPRFFPPPPFL*P/ 
SPPIYFF/CAPKKKNFFPPPPGKNFFFF 
KTPPPFFFFFFF*D*VCLCCPGWSAVAQ 
SWLTTTSVPRVPVI 


1723 


15624 


A 


1732 


118 


422 


DITTHLFKWLKFKK* BKGLNILFTKBDM 
QMEKNLKRCLT* FWKELQIKMRYHYPP 
IQiVmKX * KIT/STIS lAWQGYRTIGTLF / 
HC**EQPFW*FLSKIiNMILPYNPA\liyiL 
LS I YPNAIiKKHVHTKTCM * MF I AALF 1 1 
TKNWKEPRCPSICEW 


1724 


15625 


A 


1733 


407 


1 


NIKGPFRGPLIQWGLLIWPKDSFPILGY 
P PFPS PK I S FFFLARS CSGAPNHFPLPN 
QSPCFPQPSFFLGEEKKEFLPGYSIiAFP 
FN/RFGGS*RVRKGNGGPIPMGESPLFP 
F * DRVLLCCPGWS AVA* SRLTAT C 


1725 


15626 


A 


1734 


322 


362 


TAIYI YI I *LiFTDNI PSHPRTWIEIYKE 
INI F VPANT I P I VQHRNQGVI CTFRKT I 
TVTDCDPAMGSGQSKLKTWKGFTILDAV 
KN/IRDSWSKVKIG/TIN/GVRKLIPSL 
KNDFKRPKT 


1726 


15627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQ I LPPGWKGRG 
EPRWGTPPPRGKGNPPAQPPQEGGKTG 
PPHKPG*FLCFLKKKGVQKGG*GGPQ/A 
PGPKGAPRPGPPKGGEKREGPPGPTRPN" 

LYYAH 


1727 


15628 


A 


1736 


417 


2 


FLFFFFFLFFFFSFASGPEILFTCL*HT 
H I HFLPF YS KST * * P PVFAGGMFQDPQW 
LiPETKMVPNFKKKRTTLT/YIP* *KLCE 
CDIiSNFFGFF*DRLIiLCRPGWSAVA*SR 
LTATSTFQAQANRTRG 


1728 


15629 


A 


1737 


316 


338 


FFFLSPYFETESYIiHHP*GFIVKLSKVK 
DIEN\LKKTARGNYQVTYKGAS IRLAAD 
FSAEISQAWREWDNMFK7LKEKTNWQPR 
lYKTLFVPHF 


1729 


15630 


A 


1738 


197 


379 


QKRAQ IDKAIF I CRD I AL/ P * FQ / MYYW 
ATVTKTAW* WYKNIiHI EQWNRSW \ PPE I 
KSQSYSHIi 


1730 


15631 


A 


1739 


4 


401 


RGYRHAPPSIJ^FCIFSKD/MGFTVVLN 
S * PQ/CKIiPASTS * SAGITGISHCTRPQ 
MATFLIGPHKIIPWSVLWPNL 


1731 


15632 


A 


1740 


94 


117 


KDRPMVPPVGAGEDQADEPCRGHASLWS 
QLVSAPTTP I PLPGRDVPSRATPPPAAL 
AAQQPP * AS PYPLPPGLGAGHAS AS P VT 
VPFSPISESTGS*ESAL/PAPRPGGSG 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

Jocation 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, J>=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M='Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=!Stop codon, /^possible 
nucleotide deletion, V=:possible 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFLISHDLSKLTEDEVHNLiNSSTTIG 
\EVEFKVKKL\ * KKKSSGPDGITGKFY* 
TVREEVTPIL/SYLF/HEIEKEETLVNS 
FYE AR 1 1 L/ 1 PKPDiCNKP * TWIDAKTTS 
KVLPNRIWQYVKI I IQHN* VGFV 


]733 


15634 


A 


1742 


3 


442 


dlhsrveprvrpsvrkq*wllkvlica 
skdtlkrakrqpigwekifvnhmpdkdl 
i pe ykinmqksgvflftnns ys sttdnd 
inkwakdlsrhfseediqmankhmXkr* 

SVSIjVIREIKIKTTGR*LFTPKCWQG*G 
EIITIiVH 


1734 


15635 


A 


1743 


411 


2 


DPPFKHPPPEIILGAPKKKITLPPPRPK 
KCISLKGPPFFFFCRYRVLLCCPGWFST 
PGLKQSSHLGLPKWWDYRHEPYCTQSSP 
SLSFFLKQTGR*WFDLSNFFFY/CYCFI 
YFSRDRGtiTLLPELVTjNSWPQAILL 


1735 


15636 


A 


1744 


1 


393 


RPGGP PKGRSREQGREGERSRRRRPRAP 
WARSHMWGARVFSVPRSFDQRPQEKCVQ 
SSYKQQLPARARDGTGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSLPSR*DY 
RRTPPHPANF*ML / CLRRSLT 


1736 


15637 


A 


1745 


395 


0 


PSAPSFFSTRL*LGEPPGFPPPPFLKPP 
PRN/SIFGAPKKKFFLPPPRGKKFVSLK 
GPPLFFF*DGVLLCCPGWSAGFKQFSHL 
SLPSS*DYRRTPPHPDNF*MF/CLRRSL 
T 


1737 


15638 


A 


1746 


397 


1 


CGNFLKREKNFEARRFLQK*AARFRNIR 
*VTPEITAFCPL*HVASPLVFLTPNFPT 
IPQLYCIiEPLGEMGGSGSKLPPFPTSKT 
PNPLISVlSrLCFP\AIKWENFFF*DGVLL 
LHPGWSAVAQSRPTATSTS 


1738 


15639 


A 


1747 


392 


2 


FTKKGGRGGNLS P / LPPQKI F * KKKTLK 
KPFFWQRVWFPSPLFFEQKGQGPFFFKK 
PFF^KRPPDTPQPPPPSFIFFFLLFFFR 
/HLVAQAGMQWRHLGSPQPPPPGLTQSS 
QLELPHTPPHPDNFCIFGRDR 


1739 


15640 


A 


1749 


33 


403 


IKGFKKKRGKGGAPQKKGGTGAKPPPPP 
PQGF*K*QKRKNGSPPNVPFPNPGGPPP 
PPPFWGDKRGGPPRGGGAPPPRGKRENP 
FFPTLA/HGKKEKKK 


1740 


15641 


A 


1750 


3 


396 


KRQTTNWE KVFA* KNX ADGL I SIj I RRKC 
LKVKK/W*R/DiyiNTQFTDKGILMTNKHM 
KRSSTS L I MEMQHKAGVI FHPS DWQKC* 
STDNTQS*QGHEEMYTIKHSW*TPDYQQ 
PF/SESNL 


1741 


15642 


A 


1752 


30 


419 


NEGIGAGHEVSFAANKCRSMRKNVEVLA 
LlSTRRIiDGELLSGIiTSTQALPGWAYLHLL 
SHHAVRPLFLCFKRGWVLLCHPGWSAVA 
QSQ\FEI,LGQVItjRPHLPSS*DYRSIPP 
CLANFKMFFRERARYACR 


1742 


15643 


A 


1753 


16 


410 


VGPKKSLQQVWAAVQATLPLESYDLAHP 
IILKVSLADRDAI *NLWQIPIVAS*YIP 
LGF/ YSKAMPSSVD r YSS FEKKKTHFFL 
TGGGGPPLYSNYIiGRLGGANHLTPGVKN 
QPGQFGKPPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


NSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLIQHQICVHTR*RMYI*SR/CGKDFT 
QKSTLI*H 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

I'ng to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanlne 
C=Cyste!ne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L-Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Th reonine, V^Valme, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=pos$jble 
nucleotide deletion, \=possible 
nucleotide insertion 


1744 


15645 


A 


1755 


138 


380 


XPXXXSPXPPPCPXPLSSLFLTKCSAY* 
TPAHRPPPPGPFVPPPKPCCPPPPPLRP 
WPPLPLLTNPI PPPTLSFHPGPAIiS 


1745 


15646 


A 


1756 


194 


3 


WLCIPIRHTTEQQPGFPF/LIIWSQPYS 
PLFCLF VF* DTVSLCGPGWNAGVQSGLT 
AASTSQAPSL 


1746 


15647 


A 


1757 


2 


403 


RVLFSPTIAYTYLLFYLGATILMOVNIW 
KQPNCPTK/GQQKIKLQYIYRMEYYSAIi 
KBCK* IIiLFAIR* VULGDIMLSEVSQXSR 
KKNIVLSHMW 


1747 


. 15648 


A 


1758 


398 


65 


PFFFFFFFPP*TKGLGCIHRCDHGSPQP 

rihXgssnapslas * VGGTTGAPHHARF 
TLIQSSSVHVSTHTLHPYPSSLP 


1748 


15649 


A 


1759 


456 


31 


FAKRITDKKLLSLIYLAT*KGEITKAIiI 
KQ* \LKAQAKNSLKRHTI VLNHMKL/ CL 
ILLI1KEMQIK/STLRYHFFFITLAK\I 
*KLGNTFCWQGL*GTLIHCWWECK*HNS 
YVGG 1 WQ / PSNKL YVQ I * YNS AI SNLDG 
RVGRPG 


1749 


15650 


A 


1760 


3 


378 


QFQ YF Y YNGSVI KAW YWS KNRK\ IDQ * 
NRTESPDKNIiHKHMQLIFDKGTNTPQWR 
ICDDLFKKWCWWN*TSTCQKKKKKKGGGP 
* KEQNLTPPGWEDI I FLFGAPKNMPGAG 
VKTRWGGKNPGFPQ 


1750 


15651 


A 


1761 


69 


384 


YTSAS WGGARYTAS AAGWKTLLLLFLF I 
*DRVLLCHPRWSAIiTQPRIIAASTFQ\V 
KQS F WLMH 1 GD WD YRRCMP P CRANFFNF 
LCKKKKTIiRRQEVWQTPALVRV 


1751 


15652 


A 


1762 


390 


1 


KFSTPGMKNLFFLKAPPFFFFCRGRVLL 
CCPDWYSTFGLKQSP/CFSLPKCWDYRR 
ESPPQAFFVFFGLPSLPFSPPPAPSLSQ 
SSSFFF*MSFHSFAQAGVKWLN\liFGSL 

QPPPPGLK*FSCLSLPPTRP 


1752 


15653 


A 


1763 


2 


390 


PRVRGFFMRKFVDSYLVPYTKSNLKWTI 
DLHVRAKAINLLN*NVRIHLYDRELCNG 
FliEM I KKTKAAATTTKLDF I Kl KNFCAQ 
QMSS /MKVKRQSTKWKKKCSYHISDKGL 
VSRKYViCKAYNSS IRTQSH 


1753 


15654 


A 


1764 


334 


3 


WSKRSGPPrjSKNQTKKKNATKPQT*KNW 
INEIGPI I /NTSPSKEKTGSNGFTD*FY 

* TLKEELLS ILLKLFQKTSKS VI FPKS F 
YVKDHS S CLSGIHPKDAIMMEQHMQ INQC 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKKYWKGSFLKKHFFLETRSLS 
D/SPG/GVQWRDHSSLQPRTPGLKQSPH 
LSIjLSSWDHR*APPCPANFRFRLKTRIG 
RDV 


1755 


15656 


A 


1766 


402 


386 


FKKKSRLRKI / KVQDEAASTPDLAKTI / 
DYEGHYTKQQIFHVDERAPYWKKMPP/R 
TPIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVLTYHS /ENPRALK/HYAKS 
TLPELYKWNTKAWMKIQKFPS*FT* IFM 


1756 


15657 ^ 


A 


1767 


2 


406 


PRVRPRVRKIilTLLNVSQRWSSEKKKKK 
KKTKKKKKKKKRGGGRL,*KKKKKKPRRG 
RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGVGSTWEGISPIKH 
SKKTKSTK 


1757 


15658 


A 


1768 


14 


409 


lASGFLFFIYFGVIGRPPKRGGVFFPGG 
GGAPHPPQGGC*KKKRGGGGGPPPFFPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Phenylalaninej 
G=Glycine, H=Histidine, I^Isoleucine, 
K=I-.ysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proh'ne, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLGGLGRPRPMSKIKTPPSAGGKPP/PF 

LRNQN*TGGGGPPQ*PPLFGGPGGKNFS 
PPRGGGGGGPKTPPPPPPGGE 


1758 


15659 


A 


1769 


305 


1 


TKTGPFPSNRAKKKKIVFQKKKKEKKIN 
CFVTAPFKGIKTEATD*EKIPAKHLSVK 
G/LVYL* YI * ISKIYKELLKLNNEKTTI 
P IKK* AKDLSRHFTRYTD 


1759 


15660 


A 


1770 


148 


402 


FSPVSLG*GRGNIYAOyiSNV*EVPPEID 
FQHEVKRALQTSFQVKLVKI IFFKSTI * 
KSLAK/WtiAWAHAC!NPSTLGG*GGWIT 
RSGVKSEPGQHGE 


1760 


15661 


A 


1771 


250 


2 


KKKKEKKIICFVTDTFKGIKTEATD*EK 
IFAKHLSVKG/LVYIj*YI*ISKIYKELL 
KIjNNEKTTIPIKK*AKDHSRHFTRYTD 


1761 


15662 


A 


1772 


407 


1 


KKIRRGGGQPPIiFPIilGGGREKKFFFPG 
KRGFY*TKAGPPPSSWAKKKKPVFKKKK 
KEKKI ICFVTATPKGI KTEASD * EKI FA 

KHLSVKG/LVYL*YI*ISKIYKELLKLN 
NEKTT IPX KK* AKDLSRHFTR YTD 


1762 


15663 


A 


1773 


1 


406 


KKKKKKPTTFPGIiPFFFSLSLPPRVPGA 
RGPPSSFPSKPGLGC*IKKSGVSPAPKK 
KKPRKTNPKNFPRPNPKGGGEKFFNLPT 
HPPGGTPPFF*RRRENPPGFAPQKG/EV 
FSPGGRNSRGEPRGGPKKKKKKGG 


1763 


15664 


A 


1774 


2 


378 


AAGEWLHQGSLQSLPPGLKQSAPLGVSK 
W*ISIPWHDPPPPAPRFVWWGGWLRWS 
FFLGAQAGIFFFF* lEMGSHYVAQAGLE 
/PPRLQGSS*LNLPSSWDYRPVP 


1764 


15665 


A 


1775 


1 


431 


QQMRDKRNIjFEHNK*GIRGI YLNI IKAR 
HEKPTVDTILSGESFSSKIKTMIjISPFL 
FNTVLGVLA*ARKRKDI * VGK* EVKS YM 
FTNDMI\LDNPKDSTPKKKKTGYFMGGP 
GSKPPPQRGGAFLSLTRDPLEREFPKTA 
LFTLGQKKIKGPKFFS 


1765 


15666 


A 


1776 


334 


402 


KGGGGVGG*QGPWRIjAHCTDK\KEERKR 
ERERKRQRKKERKKERKKEKKE *MKIS!NK 
KKNDK 


1766 


15667 


A 


1777 


406 


3 


SPSSSSSSLFSPPPFPWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV*FF 
FF\LGPPPPFFFFFFFWEAGFPFFSPG* 

GP/SGPMAGFRSLPPPGNSIiSKKKKSEG 
LGEGGNS VIiTR^^LL ZSSYQl PGNPR 


1767 


15668 


A 


1778 


70 


409 


LISFLVSSLIVRLYRPLLCLFPDPSKGH 
CIPCIiLPLFWILLHIPLFMHSFIYCLIN 
DRVLLCLPGWCAWRPRLTAAS ATQ\ I K 
RSGSHLSL.PSSK* \ WDHRRCPGFFFFFF 
FF 


1768 


15669 


A 


1779 


390 


31 


SHLS LPKCWDY/RL * ATTPSQKI LVFTH 
G*VLSLLSLSFLIPPD*TF*KMSLMRPS 
LTQKS S 1 1 QHDLLDKVPITI FI.RQSL/D, 
S / VTQARMQWHDLGSLQPQPPGLKLSSQ 
PQAL\SSWDYR 


1769 


15670 


A 


1780 


357 


1 


LTIjFWGAQYIjPKKGKGPFFLLSLPSPSV 
WGPLPQKKKSPPLCFPYFLNRVLLCHPG 
WIAWQSWHSSAHFSL/VLTRFK*SSCI, 
SLLS SWNYRCTLLHPPNFLNFWYRQGNA 
VLPKLVL 


1770 


15671 


A 


1781 


122 


254 


RKNE \WSGAVAEiACNPSTLPGPGGQIMR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
Itocation 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lrysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R==Arginine, S^Serine, 
T=Threonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=poss]ble 
nucleotide insertion 














SGDQDEPG*HBPPRPTDSFFIiYPSH 


1771 


15672 


A 


1782 


45 


387 


TQTPKLRRFFHFSLIiKSWHYRCSPLXPQ 
HNGYFLTNLDIATSMFFFKDRVSL\LAQ 
AGEQWCDPS SLHPQTPGLRR\ PPASASQ 
EGETTGAHHHTWRNLIYFFYYTYKF*GT 
YPMA 


1772 


15673 


A 


1783 


1 


401 


FATLARIjVSNS*PE/CDLPASASQSAGI 
TVSPAHIiAISPSFPAMPSSGIiGLSYPAY 
HPGLGLRCHLPVLTSPWTSDTGPSSVIiP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 


15674 


A 


1784 


432 


1 


FLFLFFFIKKKTPFFGAKPKNRGF*KPP 
EFFQIF/S/CPVPLGRFPKPQKEFFFPE 
REMGFPFLFSPQGGGPARGFPPPPLNQG 
EAPRAGHKKKGPNLGGRTFFFFPFEMES 
\FSVTQAGVQWHDLGSLQPLPPPFKRFS 
CLTHAS 


1774 


15675 


A 


1785 


15 


434 


RLSLSCCGREEHSTIiPGAPWRCTEIAWA 

DSPDPAPSPPSALPSLLPFHVYRDVCPV 
LCLRGWP* *MVERGRLGISPTWLLGWPF 
PGGA/PHIKPE*YFIjFAYTILRSVPNKL 
GGVLALLIiSILILtAIIPILHISKQQSII 
F 


1775 


15676 


A 


1786 


1 


258 


CWPETPVIiK* SHLDLTKRGEDRREP VWA 
ASTTIFFFETGL/NSGAQAGVRWVHLGS 
LEPLPPIPSLFMTP*GPCPGLGSSWPLR 
EF 


1776 


15677 


A 


1787 


399 


63 


SLHNQWKSTPTLKTSKKISHI *STWPG 
WAHACNSNTLGHHGGRTA* /RSGVQDQ 
PGHHSETSSQRLRNPYIKRCIKYLAHSK 
CCIMDSFTVSVTSRKLIGKREVSPNNIT 
FR 


1777 


15678 


A 


1788 


3 


474 


MS I S PVRWNS KEAGRAANRQFPS FSPWK 
DDSRDAS PPEPAS PTIG\ P IRRLAES S W 
TWGSPCAEHPRARAGRRKAATDCPWAAG 
SQWRGPAGQGAPRS CLFPGSRTAARAQH 
PRVAPPPPPAPLNTRASALRSQLPRNPL 
* VMTPRPPAAAPRS P VGP 


1778 


15679 


A 


1789 


66 


395 


LiVQPLFDFIWHSRLSLWSGRDLLWSGKE 
TMNPCLNHSIGVLQEWQGSDVKKRRRLM 
ESLTGPAADVIRILKSNNPAITTAECLK 
ALEQVFGSVDSSRDAQIKPLNTYQWP 


1779 


15680 


A 


1790 


413 


1 


PSPRAIjITDYS*EEGPRFWQV\EKKGQP 
LKPHPGLGSPHQESPRVGPPKRGGYNPF 
* KKRAKFPAPGENKGPPFWTGRDPTF * G 

KTKKKTNTPPKKGGKKGPPPKPGQFFFFF 
LRRSIiAWSPRLECSAAISAHCKLRL 


1780 


15681 


A 


1791 


314 


1 


KTKPFFIjKKTPPKKKNKKKGSC*DKARY 
QTRKG/ IVNLGH/HPSFLFYFILFFETE 
SHSVGKAGVRWLRANSLQTPSPGFQQFS 
RHSKPSRKDYRHPPKRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


BKQSPAFDWHFDIiYFDNVK*KEGBBSKA 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
PAS VTQEAADEFPDNPKKI TEKKGYLHG 
KFLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL*REREREREREREGERERAR 
ERERARQGTSTVESRF/HSYCRRDQDVA 
RPKAKGE VAAGRGS PDGLQVGRGQ PGP 
SLRPGPWREWGFATYIxACGGP I PTGEVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO; of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=P!ienylaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TryptopIian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SGEEWAQQGEWGQALGPP 


1783 


15684 


A 


1794 


405 


119 


IIKMAIVPKAIYRINTISIKVPMAFFTK 
\LKTIIKFI*N*KRARIAKAILSKKNKA 
GS IPLPDFKLYYKAI VSKPTWCWYKNRP 
IVLVHSRTARKK 


1784 


15685 


A 


1795 


455 


3 


CSVTQAGEQWCNHSSLQPQPPGLKWS/C 
PSQLPK\WNYRHVPPRPALTAHPALTAD 
F*RRKEyKliMRHRGKKWHDFTLRSKKMK 
AMRHEYHSFLTYPNGHLYLHWQFFFFDT 
ESHSVSQAGVQWHDLGSLQPPPPGFKRF 
SRVGPGGNSGAD 


1785 


15686 


A 


1796 


2 


134 


PRLQHCTPAWATPQDSVSKRKRDMTTDP 
AX*KG* *ATTTNIYIHKFYNLDEIDQFL 
KKHKIiPQLTWYEI /DNIjNSPITR\rEFV 
IIjNSKKKYPGSDGFTGEF*D*PCLETKK 
RYD YRP CNLKRI I SD YNKHLHT 


1786 


15687 


A 


1797 


1 


404 


PTRPIiTGSSASGMMVEIFPKTYLISSAC 
WVKISKIDLKLFSPFFFFGFLEG6FYFC 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/ PP / AVGGGGKPP I * GNPP P WPPQG 


1787 


15688 


A 


1798 


2 


383 


SGWLWACRS PDS EPIiACPAGPRQ * GAYH 
PGQQLWMVR*VSQQCWPTPPALHSIS\P 
RP*LPPRRAGLGTCSPP/VPESPIjIiPPW 
APAWPKPPRRALPPA/PPVPGPIDRPRA 
EECRRMVRDWQAAP PAAP 


1788 


15689 


A 


1799 


35 


410 


ATGPSLGKVCS*AFSFL*PKLDFWHLYF 
QASGFLL / CHNS P S QTNSCS S FGKHYGM 
VliRVNSIi* PDHPWNEFYFLNFFI FYRDS 
/SLTILPRItVSNS *AQTILIiPIiP/PKVL 

GLE 


1789 


15690 


A 


1800 


65 


415 


KKGVLKGGPPPQPLGV*GPPPEKGGPGP 
F\GAPKEKPPLALGEPKGKPF*RGPKGF 
FFKKGLEPGAPPPPKPRGEKPPVFKGPH 
PPCKKGGGDKREKNGGIi*RKKPTELGNG 
PPPPG 


1790 


15691 


A 


1801 


417 


3 


NLGPNI FPPRAPQNWGPAPPLFFFLKKN 
FTWGGGSTPLFP /LNLGGLGGPFPGLEV 
YAPPSPHG*PRFFFKNQKLPPPWGALY 
S PFFGGWEN\RKAP PGQTL F FFF FFLK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 


15692 


A 


1802 


1 


431 


QPCTPGLK*SSCLSIjPSIGDYRC/RTTV 
PS*FFFFFLEKGVGFIPRGGIKGIjDNC* 
LGPHPPELK\NPPPQNSQEVGTTGPPPR 
PG*LFFFFETFFFF*KGGFFILAQP*iyiK 
WGALKKTPPAFSKRGGYRGGTPMEPTHC 
FLKEN 


1792 


15693 


A 


1803 


256 


399 


AIKNIHDS * EEVKI STLAGI WKKLITTL 
IDDFDGPKTSVEEVTVDVME 


1793 


15694 


A 


1804 


407 


2 


FEKANLPPFLFKNSGPPNVPGRGMAGVP 
KPCCPSKRKVPGP I FFLRRLFLFYPKCR 
RPPLFWPAFGP*KFFLPQVFAPAF* kps 
PKKKRAPVFFPFFFF/RDBCVWLCHPPWS 
AMA*SQLTVTSVSWAQAILLPQR 


1794 


15695 


A 


1805 


429 


116 


IiIiTKK3aRKKKPriACGESGriGGP*GWTG 
V*QAEDTHVIW\VIiSSAPSLSSEEMTDS 
MPGHLPSKDSRYGMEMLTDKKWTWDGGA 
WDSSPQGANGKRGARQASGFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucJeotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamie Acid, F-Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Ijysine, L=Leucine, M=Methionine, 
N==Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *==Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1795 


15696 


A 


1806 


2 


408 


F VI FVFL VETGF \ S I LARALACAr SNS * 
PRDLPTLASQGAGITGVSHRTWPRRSCV 
F^EAFLTMLMVLHIPPSLLWHSVFTHLL 
AIjSS * CFLFFLRGPWAVTQAEGQGHDLG 
SLQPPLPGLKWFSCLSLPSS*NYR 


1796 


15697 


A 


1807 


1 


196 


FRLGAS LDLSGGCS GVSWTPAP PLPP P P 
PPSAASGIS/SGSTSAAAGIi*SCRTAFF 
SFLSSFFFFFLKKINPFPFLGGr*PFKG 
GPRLV*GNI*PPPQG\DFLGGTRGGKKP 
WGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\ P P P P P P S AAS G I S MAAHLQL PACDRAA 
QIiSFI/SSHIiFFFFF 


1797 


15698 


A 


1808 


395 


3 


LGKKMKNPKFWQGCEATGTL/M/HCWWV 
CKFVQSFWNTDS I Y*G*AAATHDSAMLL 
LGMHSMPACTFVHQKT*TKMFIAALFIL 
PLNWKQV/RCPSVI\DGYIPTMDQ9TAM 
KiVINKLHAKTWMNLRNLMLNEKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISLQPPPPGFG*FFCLSLLSSWD 
RQPAPPR/LAt?LRR*T*LQSA*LP?R\RG 
PILDEMKSFMCNCNSIjTEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK*SARL 
GI,LKCWDYSCEPPCIiTQMPPSFPLFP*D 
RVS/PLSPGWS 


1799 


15700 


A 


1810 


20 


355 


pqcahgcravapvcvcfcfcvcvcshcv 
smcmcgevsagwtllclclst\csgaya 
cdwgcg/ cysacvcarvcacvcaqliltc 
igmwe*gqagqgevliidlplciicscwap 

Q 


1800 


1 570 1 


A 


1811 


3 


414 


SSKWDlSTNSLQEFiVnDKIAGMKKNLGNLTE 
LNNTV*EFHRAITS INNRINHAEERISD 
LENWLSE I \NRQT* KIVTRNEQKLREW 

DYVKRLNL* I IG\ VFEREGGKAYYLQNI 
FED I VHENFPRFARDANSQ I QEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKSPSPKKFKKGPGGGRWS-* FLGGAGG 
GVPFRPGIQGSNYRFF/SPPPPPLGEKK 
KPPFKKKKKTPQKLQNGNPHPYFN-PRVH 
GCREQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPLFTRSST 
/SGASPP 


1802 


15703 


A 


1813 


411 


66 


VHKNNVFNKEChER * I F/ 1 IQKKKIiDP \ F 
LTRY I KI KS K* I KDLNIRLE I KKTPGKE 
S VTLAKWL I RS P YLS F P * Q I QS KQ * INS 
YVLIKITKEERWSTSKEYQKPWRERKAM 
YKIK 


1803 


15704 


A 


1814 


369 


20 


qevrps i ylsssnrkyvkdsnarftkea 
iqianihmkncptslivgethiktskgy 
hyvpirmakilkdXcnrrcgetgtlihf 
*wewkmvqpfqkpp*wfpkkrkih*qfd 

LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAI LGRRLRGE ACTRS P FS S I L I FMVS 
MGEWPGVP/ GIKGTRTGAEAVPTRRRKS 
SIWPQTGGAGESSG/PLRGLPRQGKPGS 
PRGPSGPPQNCARWWHPQAAPLGACCFS 
GPEPSRLPPWRQGNWCliPSTPSSSA*EG 
WR/PVLQPGFL*SPSSriASICPGAERGP 
PGSSRP /GLRGAPGG 


1805 


15706 


A 


1816 


273 


2 


MESHIMWPFTIGFFHLVQCLHGPSMIiQC 
IAVLHSSYIiFIYLFIY*DRVIiLCHPGWS 
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SEQm 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etii 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, X=l5oleucine, 
K=Lrysine, L^Leucine, M^Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S^'Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














AWQS * LTGASGSWXLRRSFHLSLP* C* 
DFRHEPR 


1806 


15707 


A 


1817 


40 


404 


GLiHE \P6VQGCSEL* SHDCTPAWVTSET 
LTLKKLKIKKSKKKKRANPRGFERPTFG 
EAGPGGLLKAI ISKPRVIQGTQKSGALL 
GFNSNGGENHPPQKQPYLGGLYAKAHS P 
l^GGGPPSPCG 


1807 


15708 


A 


181S 


392 


3 


EKYNMSYDIKSTNHRKNCKLDFIKI/RN 
CCSLKDTINKMKMQASNLEKI FAIHMPD 
RGLIFKRKNSCNLVRR*QPPFPKEAKDL 
NTQKSQ*TNG/HGSKETSLIIREMQIKT 
T/MNYTTS I PT*MLKIKKMNX 


1808 


15709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
*KLCTTVBK*KTISN/Hr)VPIRSSWTGM 
VAHAC3SrPFTLGGRGG*ITRSGVGDQPD* 
HGEGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFGFGGIKPPPKQKKVRG 
GFFAICPKEQVFFFFFFWRAWQSLNHC 
SLQPQPPGLiKQFKQSSHLSLLSS^GYKH 
VLPCPANFILFSL>VETGS\SIYFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


TPFFFLRVLRLTPPLiLGNFFGPGFPPWG 
GFSPGPIiLKGPRP/CFPIFKPIFQPGKW 
GnVFFFFPPFFFYPRGSR*NLKKIFPNF 
PFFFPVFF* ILNPPFFFFFFFPFFFL*D 
RVSLCRPGWSAVARYRLTASSTSQ 


1811 


15712 


A 


1822 


362 


76 


SEIAPLHFNVGDRVSLHLETNKQSNKQT 
KKTL I FRDRVLLCRPGKNA/ VE VQ * * LP 
AASN/FLRIiKQSSCISLSSNWiYRHAPP 
HLATVFNFLIIFE 


1812 


15713 


A 


1823 


314 


2 


VISKPCPRELTCITYGVSLTQCSMFGRM 
KGLLLIWPPVCEVRRASGRPPLMGSEEP 
LCPAATPSGRCTQQ/LH^ERAMMTMAVL 
SNRKGGNVGKR*RNQIVAVS 


1813 


15714 


A 


1824 


57 


389 


NDHLLQIiPTYTDADSTGPTLSGMNVKNIi 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
FKCS CLS FLRRWD YRCAPPRRATF\ * FL 
VETAFLERLTSCDLPTSASQSADITGV 


1814 


15715 


A 


1825 


410 


70 


VPIMSATQDYRHEPPRPAGRFLKKIjKME 
PPHNPALLLVGI * PKNMKSLHKDVCTPM 
FSGTLFAIAKIQKKPNCPS SMDEWINCR 

nyXmhiydgicysalkkneilarrsgtr 

L 


1815 


15716 


A 


1826 


2 


411 


fiivemgfxsmlviaglkiiptsggapasa 
sesagitdvshrawpvpfflkrclvlvg 

RS * *A\ WPHTNItl PPLPSGIKGDLCPNS 

aggwekgapppspgkf/cef*grtgttn 
farg/wtktpd 


1816 


15717 


A 


1827 


276 


3 


grpgpadfrvrpqllqrflfiylfteme 
scsvtqagvqwcniigslqplppglq*fs 
cv\k*fsglgllsswdyrhmpphldnks 
ifsrngvs 


1817 


15718 


A 


1828 


1 


391 


lieprrrflqcvqdcatalqpgqqsktls 
qkkkkkkggpp/s*yqkgqgcpsgkkgr 
gvagkgafgpgfggenktppgggptgeg 
pfpqkgwgpsqgptkgpnlwgpggpkl 

GGKGGP PGP TKGGGGPS S F 


1S18 


15719 


A 


1829 


2 


134 


DHLSPWWNQPGQHSEAPSIi/IilNIWKIi 
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SEQI1> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaJanine, 
G— Glycine, H=Histidine, I=IsoIeucine, 
K— Lysine, L=Leucine, M^Methioniney 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X^Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, N^^possible 
nucleotide insertion 














AGCGGAHIiSSHL/ LREDHLNLEVQGCSE 

P * LHACTPAWATE * DPVS QTKLN*NI WK 
LAGCGGAHLSSHLLGRIT 


1819 


15720 


A 


1830 


427 


107 


VQHQPGQQRESPSIIIIQiCLARGGARCL 
*SQLPGKriRLENRFN/SGSERSHQCTPA 
WVTDRDCLKRTGITRASSQRLSVGIKQS 
CLNPRTAQLQVSAQSPSTVSTNL 


1820 


15721 


A 


1831 


3 


540 


VQFPNFKIYYKATAIKTVL\QHKQR*ID 
E/MNKI * TPE INS YVYGYLNFNKDAKAI 
QWGHDSF\FKKWC*DlsrWISACKYSQTSV 
SASSASSSSS 


1821 


15722 


A 


1832 


385 


2 


AGRQSETPYHNSTIMKVSLQVCVDLSCL 
LGPGSLVQDSPNI PAQRKRFLRLGSFWL 
PLSRFSRVGWPIiPHWGQRSSGFSLPR\P 
P*SQIPAPRSPPPAGPVPARSWVCGPRP 

QTRPLPAERPSPRPRRL 


1822 


15723 


A 


1833 


7 


399 


RISRSYLSEYGGSGKEHPTLGASYARIM 
VFG/VFriYF*RQCIA*AQWYSHSSLLP 
QTPGLKHPP \ AQAS * GAGT I GTHHHT * Ii 
TFAF/lFVLGCFFri*NKISVTQAGGQGC 
NFGSLQPPPPGLKRVSCLTLPR 


1823 


15724 


A 


1834 


2 


306 


LARLVSDS*PQ/CDPPASASQSAGIICV 
SHRAQPAEEILKVFGTLCLEPQRPTPDI 
PIIPYLFC\LFEMESCSVAQAGVQWRIL 
GSLQPLAPGVKRVSGLSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFS CWGPNMKKRD 
SPVFHHQDPIFPIPFFFFGNGFLFFPP/ 
LAGGQGGNLN*PNPLPWGLKEFPPPTPR 
GRGEKG6APPPPINFVFLKKGGFPLGGR 
GGLEPPPLGDPPPLPPKRG 


1825 


15726 


A 


1836 


220 


401 


KGSFVFIPQPEGEGPFLG*LKPRFPGLK 
QFSCLTLLRSGNYGPLPPPPVIF/CGFIi 

R 


1826 


15727 


A 


1837 


12 


357 


GIiLGGQMNGSLGTQTSYEDLMSSS\FKP 
NSPPPTPS/VRTGHLiPK*PLESSNGPPP 
PQVSHSFQGQWARGHPSPPPQWNTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEGSNA 
PSLiA 


1827 


15728 


A 


1838 


8 


380 


LMKTHAKIiLNKRVAN* IQQYKK/HH/NQ 
MGF I LGVQI YFNF* KINL IQLINS VKKK 
KN/HSSSSSSSSSSSSSSSPPVLIKSLC 
NLGKNKRNFLCLTKGI YKNKTE \NSMKI 
I LNGEQLNAFPLRLGTK 


1828 


15729 


A 


1839 


2 


444 


VPGDAKWFSVLHIiKDAFFFIPLVPBSQY 
PFAPEWENPNTREK\TWAVIjP*GFWDSP 
HFFAQPLERDLRGLQLBDGSILQYVDHL 
LVYSPTQEASDQNTIKTLHFPADRSYKV 
S KKKAQITLQQVHCLGYILTPGTCK/LS 
PERVQAI 


1829 


15730 


A 


1840 


1 


642 


EIKGIQI6KEEVT/SLFADDIDYLRIDS 
TKKLLEVICEIJSrKVG*KINM*T* IVFLY 
IGNEHLEIiEI/R/ELMPCIKTSSTMKYL 
EINLKKDV*DI.YTEWYKVFPREIKIT*A 
NQ*EILCLCSRRIJSrTKM/STFPQVFCVY 
YAI PVK/ 1 PSRNFLVLVDKLILKPI *KY 
RGPRTAKTTLKKKKKWRLTLLI PKSYY 
KTIVITXGWYGFQDRQVD*WITRIS 


1830 


15731 


A 


1841 


3 


423 


HRITSE*DLHLARELIW\SiyiYGSLDHKN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanjne 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M^^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














YDAELDRMAMPCLCAIAGALPPGYVDAS 
SSSKAEKKATVDAEGNFDPRPVETLSVI 
IPEKLDSFINKFAEYTHEKWAFVKIQNN 

WSYGENIDEELKTRPMKR\ PY\ KTYSEK 
D 


1831 


15732 


A 


1842 


33 


179 


YMLGGRGCSDLRSHHCTPAW VT/ TA* LS 
KKQYQRQQNRLQYHLQBYERKK 


1832 


15733 


A 


1843 


349 


10 


LAGGLNSMEGER\LR*EREECTQQQMVH 
DKYCKDLiMGFGTKPRHITPFSS FQkVQP 
QQSNALVGLLGYS SHQGLMGFGASPS PA 
KSTLVESRCCRDLMEEKPDQRK\QWVLK 
CR 


1833 


15734 


A 


1844 


15 


856 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMLGSTHLPNLTDMLGPELGVQGIP 
SPGCACQRGRG\GGGRECCSPPGVSPQG 
VSAVGRGAEGPGGLTRSGSGAASAIiWP 
GEKGCWCRTASGAGPQRRQRTRGPGSWG 
LSFSQTSEEKCPSPAGSAGAGPVCQRRQ 
SSFAGGCGTGAGAPGST\GDAHPAQGGS 
GGPLRSLPAVGGPRPGPSFLKTSGSGSV 
PQGVP I liWSLT * RAIiAAPGSQGPAGLtAV 
SCTGGRGYRDPQAPGTGAG*HGNSrRLR 
GP 


1834 


15735 


A 


1845 


402 


2 


SKARRQGRPLRQG*APG/AARIPEQKRI 
GGP /EERRRPSARGPRATRVAGEGPKPK 
GQTAMAGGGHDPLPLPPARS RSQE S IGA 
RSRGSGHSQEQPAPQPSGGDPSPPQERN 
LPEGTERPPKIiCSTLPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


TSRKI S * KTGKQFLMKECS CNSHHKAAF 
TKKDVLWI \liAVVKHVNTKASETFHFFQ 
SGQAKVQQGFVKEGCELINEALNLFNNV 
YGAMHVKTGTCMRLLDRLQYIMGDYAEA 
LSNQQKAVLM\TERVMGTEHPCIRPL 


1836 


15737 


A 


1847 


440 


4 


VpGRHVEVSKKGGQVNYQAG\KTVEIWA 
DKLGCNMLGTADMVECLKS TRYKBLIQQ 
AITAGGAPIAFGPVIDGNVIPGDRQILM 
SQGEFLNYDIMLGVNQGEGIiKFVDGIVH 
NEDGVTPND*KFSVSNFVDCMRPRRGPN 
YSRFQ 


1837 


15738 


A 


1848 


526 


0 


PRRDPPPKRQTPIPTHVSPWEKGPWGP 
APLRPDHPSLSPCPAMG^K\PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSS IPSLiARFTRPGESSPIi 
PPPQTPSGPP 


1838 


15739 


A 


1849 


417 


31 


QATGQECGCHRGGPPPGPAGETEPQAPL 
RLPGGTGIPWGAGILCPS*I«PGPGSIjSP 
AAGRG/SGPSAGPGAAWFSSP* /PACPS 
SSRSAVPGGAGSFRRAGPGLFYTLPAPP 
WCGRGASNKI IQMPGL.VC 


1839 


15740 


A 


1851 


3 


285 


YTVCECVCVCLCVCLPVSL \SLCLSVS V 
CFFPSLCGFVCVCPCACVSIAECALCAT 
KRFV/ CMAACLW*AS FCVSTWVMRPAVN 
RPRRGGSAIiGA 


1840 


15741 


A 


1852 


128 


524 


KIPGLGQRSEGVGQGKDDLHVTAPVPTH 
6WGEGAASKPTVIiPPPPP\PDAPTVFFF 
PPFFWEKKYFFGPPPKGAGPQIYLLGPN 

ppgfkpffpphpqaagr*yikpppwykc 
pllkkrgvstlapXgfpkppp 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utanime, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1841 


15742 


A 


1853 


1 


1648 


MTVPLHSGLGKGVKPERKTTVRRPFVSA 
GKKYACPFKKAETPQWRRLRMRPKAPAA 
SLLATFPISQRIilPLTPARKHCPSESET 
STWWSKAPATPRTRGAGPTSPPRPTRR 
R\ *ACSLQKLFAVEEEFEDEDFIiSAVED 
AENRFTGSLPVNAGRLRPVSSRPQETVQ 
AQSSR/PAAVTPHCSLRGPGPARLGP/P 
ASLPPARPVXTA/GPSCIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDEFDKVL 
ASiVIELEEPGMELECGVSSEAIPIIiPAQQ 
REGSVIiAKKARWDLSGSCQKGPVPAIH 
KAGIMSAQDESLDPVIQCRTP\DPP*DL 
VLWVTFLFQ/PALTVPTQQLHWEVCPQR 
SPVQALQPLQAARGTIQSSPQNRFPCQP 
FQSPSSWLSGKAHLPRPRTPNSSCSTPS 
RTSSGIiFPRXPLQPQAPVSS IGSPVGTP 
KGPQGALQTPIVTNHIiVQBVTAASRTPQ 
QPTHPSTRAKTRRFPGPAGILPHQQSGR 
SLEDIMVSAPQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


ihkfihqiwlakitcqrtkv*kersvll 
ptsfspvpsqghytckqXlcslasdlsq 
pdlwkfmniavlhamwnsrkvscypwt 
miyflhanrt 


1843 


15744 


A 


1855 


373 


3 


IKDGIYRYFYKALDSFCLCCEFMNQFFS 
CMDRELSQRCFLNSAY* FPSNL * CYLC Y 
LFPFF*LKKYFLTFFLFLRDRVLLGCPE 
*SAMVHS *LTVPLDFW\VKGSSCHSLLS 
SWDYRHALPHLY 


1844 


15745 


A 


1856 


378 


1 


RQRHS PAGNTGRPQ VTPCG * I S WPS ITK 
DRTSMSSVTSGALGHTAASPHARLLPLA 
LPSVRTQHGSPPPGQEQPTIICPSNLPT 
HPSLPLGMHPSVRASPPLCK/P/SPPSI 
PASVHASIOIPSPPVY 


1845 


15746 


A 


1857 


3 


379 


YMVRKVIEVWFLLLLLLFFPFRGGFLGQ 
GWGPPAPGFSFGKKNPQGLGSSNLRGPW 
* TNP * PVPGGALFLVGPPTPAEFPFKNF 
SRGFLLV/ALADF*TRPRLVSPHGG/RG 

AKGTPAFLESMPPWMP 


1846 


15747 


A 


1858 


452 


1 


GTHGLLLGSGPF\RQVFKPDNFVFGQSG 
AGNNWAKGHYIEGAKLVDSWDWQEE* 
ESCDCLQGLQLTHSMGSGMGTLFISKIR 
EEGPDCIMNTLSWPPPKVSDTWEPYN 
ITLSIHQSVENTDETYCIDNEALYDICS 
RTLKLTTRCI 


1847 


15748 


A 


1859 


1 


385 


NTSSDYIFPFFFLFRNSIHSVTQAGGQW 
HNQGSLQPWPSRLK\CPTASASICLHMP 
PFtiANFIiIFFVEIGSPYVAQAGSRDPPA 
IiASQ/SAGITGMGHCTQP*VFLFFPFFF 
S FRKKCS P WP PGGGP P I F 


1848 


15749 


A 


1860 


470 


17 


lEMDSRRVPRDKLACITKCSKHIFDAIK 
IT*NELASAD/DFFPTLIYIVLKG/NPP 
CLQYNIQYITRPCNPSRLMTGEDGYYFT 
NLRLGTHCSWIiMMTMWTCFRAPRIPSSQ 
WMSYVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CSRPEFPGRRFVEAVRSKPYLSLPF*SR 
* S F FNVP AE * T S AKD I LAS S E F I KQNND 
VSS\LQKFMPQEVK*LDHIHTA*ADGSW 



653 



wo 01/64835 



PCTAJSOl/04927 



SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ II> 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=ProUne, 
Q=Glutamine, R=Arginine, &=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=:Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KAELERGPDGSAPSPFPQKPAASRQPEL 
GELATFLGRVDPWYQSNVWTLCPAlHKLi 
AEMPPSIiDTSRTVDPFILDVlTYYIRMG 
TQP I YFQI YTVKI FFSDLSQDPTED I FL 
lELKVKIQDSKFPKDGFSPRRRGVAEGP 
GAELSLCYQKALLSHRPREVTVSLRATG 
LILKAIPASMY 


1850 


15751 


A 


1862 


192 


3 


SSGSHSVTQA\GVQWHEHSSLIiP/LLT* 
PPGRK*ASHLTLASS*DYRRAPPHPANF 
*IFCREGV 


1851 


15752 


A 


1863 


82 


370 


SLCQKRAFVGEKLVHGLLVSPSGGRVPS 
CPDPWGCRPRFHAI AVYS FLKLRWI PE 
VSILPEDLEELiYDIiFKVRSSGKMRGSRP 
*AGLSRG/DPACP 


1852 


15753 


A 


1864 


2 


325 


IQVYS ISHLS I YLF\ I YHIiSRGSMEiVSM 
SLSI * S I YPCMDV*MYLSNLCMHVYI YL 
FYGS I YRF YLS I CDS VYLS I YLS I YLS I 
YLI CHPS X FKTVI DEHAI FATWRH 


1853 


15754 


A 


1865 


3 


377 


YSPWCKIiFRELCKINVFDNIiDSPLLSGK 
EFNDTTHNTFDHMWIRTKEHNEAGWLLLS 
SVDKVMKENDELRDSNSWLQKQV\* PIjK 
SAKTALSGSLNSCREKAEIVEKQTQSLiT 

M*VADLQRKiyiHVQP 


1854 


15755 


A 


1866 


10 


378 


GWKNGEFIDAL* KVYGHKAPNKSAVYKW 
IT/*FKKGQDDIEDEDHSGRASTLRKKI 
HLVYAIi I DKD / * RLTAE AI ANTID I SIS 
LiAYRILTEKLKLSKLSTQWVPKQLCPDQ 
LQRRAEnpME ILK 


1855 


15756 


A 


1867 


346 


1 


DIIiWKIiQKPQPNGKMLKAAKSRKKFAF 
KGVPVRMNADFS I AAMKA/RRRWNS I FS 
P*KEmCHLRLIiYSAKNIFP/EIKTFSD 
RE FVT I RS AVKE I LKDVLWAEERLSHVK 
SRNV 


1856 


15757 


A 


1868 


1 


377 


GTFFQRTQCKGIKQ\YWGLIIKASSDP 
TCVEKEKVYIGKLNMILAQMLKQEWTEH 
WPAF I SDI VGASRTSKSLCQNNMVIIiKIi 
VSEE VFD FSSGQI TQVKS KHVKDSMCNE 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSbYHPVLFFFFFFPGGRTG 
PNP P AGGEGNETG / PNGP S T PGGGGNP P 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TNKCCSPGYSPP 


1858 


15759 


A 


1870 


2 


578 


PWKHALLLGLDFLPGKMAPWSGQGCSL 
GHTEGGTSWDFAVGGASWRLKWCVKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS *iCRG/VAPGSP*LALGMGGG\DRLT 
LISQVHGNQVTQIIPFSTEGETKAQRSP 
SLPPRDLIRGRHSWNLDSTQLLGYCPLIi 
PPPLHPAGPIiPVPFTNGEIQKENSRE 


1859 


15760 


A 


1871 


1 


382 


SGQDAGSCLLYGAGSGAMVSGAYNPYIE 
IIEQPRQRGMRFIYKCEGRSAGSXPWEH 
STDNNRTYPS IQ IMNYYGKGKVXRITLV 
TKNDPYKPYPHDIiVGKDCRDGYYEA* FG 
QERRPLFFPNLGIRCA 


I860 


15761 


A 


1872 


490 


1 


ADSLSKDPGRPLHPPWIWGDRGRQ*PGE 
TPQ *HRRSCASQDPGRSQ * PGKILPSPK 
PGRPPPMTGEDAGPPK\HGNPSPNDQKK 
NPPPK* /PQGNRIHRDPGRPPPQ*PRKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A==Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=Phenyla!anine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SFPNNLGDSNDQRSRFPVA* KNLPQS SS 
GTRAIGRELYIiPPQPGSSAPPNSTT 


1861 


15762 


A 


1873 


373 


1 


GGGVPGALFSHKKKSLFFFPPPPFFFRK 
GL*NF\KIKGCGSPFLPLLFFLEKESCP 
VP\RVECGGVILGPCKVCLPGSPPFSAS 
AS*VSGATGAC/RPRPGKFFSPFFFFYP 
LVEMRFHRVSQDV 


1862 


15763 


A 


1874 


3 


374 


YMLGKE I VS KTKIGQELGLLHQ/TSQW 
NAKEKFIj*ETKSATPiyiNT* IKRKQNSPX 
TETEKALLVWIEDQTSHNILLTQNVI *M 
MALTLFNS I KAERGEEDTEEKLEGSRY W 
FMRFKKKKAISIT 


1863 


15764 


A 


1875 


2 


364 


IHSGKGESLWDLGLDTEFLDLPSKA*Hr 
KAKNDKrjDLIKM*NFCSAK/ET*/lRMK 
TQAIHWERI FVWNI *WRKLVYKELLKIiR 
N\ KiCNTIRR* AIDMDRHFTK/ EKMPMTO 
*YTKGCSISLVIL 


1864 


15765 


A 


1876 


41 


461 


GFLYLCSSEIDPYPSPSKKIKSEWIRTY 
WMKLLSENATIiyiLKDSDLSQV\FCVNIP 
FVQAAKAKIEE^DYIKLKWFCSAKGAI* 
KAKRQPTE/W/DRIFANYPCVYGLITTT 
CMSFTQLTSLITTPTITLPWNPRQSYHH 
NPT 


1865 


15766 


A 


1877 


2 


185 


VRPTKLDPLERTQYTLPLLYKWNNKAWM 
TAHL/FTA*FTEYFKS\SVKPDE*VREI 
DYRMIiYIi 


1866 


15767 


A 


1878 


1 


491 


IHRLLiWPPLSAARPPSRESGLRCRAPRR 
PASAA?\ATAASPS \PTAPQGPPR/RRRL 
LIQPPLYPRGLFTPGVPPL/APGGSREP 
S*SLT*DARTLPLPVLGPRRARLLGACP 
AVQAEEGVDPVGHLSEVLAAPCKQP* TP 
PPACHCLDGEGRPSGVQAPLHKAKLYP 


1867 


15768 


A 


1879 


24 


449 


LQPPLiTWALLLQPPQPKSRAAPFFFFFF 
FGKKSQFLFGPPGGGEGEKLG*REPPPP 
GTKGMPPPPPRERGKKRGGPTGRENLGI 
*RKGGVPPGGRGGGQTPNPGGGGAK\PP 
KGGK*GGGRLPPPQIKGPKRGRPKKRKG 
RP 


1868 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLELiLTSGDPPPLSLPKC 
WDYRY\DHHAQPSF*LFLSVQISGIKFW 
HSWQPCI 


1869 


15770 


A 


1881 


1 


458 


PAIRAGRNLPDKEFRYLRTVIVTAAVYW 
GLNSKLRCLTS XLnTFQHRAGVSPYTSP 
FGFA*TCVFAKQLLEPILC/RPCFHRAP 
IiLPKLRGHFAE /FP *QCFFR/QALGPS P 
/RSTCVGLRHG 


1870 


15771 


A 


1882 


3 


392 


YMMRYHYIL 1 RMAKVKI \ SSTHDDTQ* L 
KLIHC*QECRMVQPLWKMV*QFLIKLNI 
*LP*NLiAILLWGIYLIEMSTYEFR/RKI 
CIQMFMTDLIVIAKYWTQ/PQCPSVAGW 
IKQ/IRSIHTVEYYSAVKRNQL 


1871 


15772 


A 


1883 


2 


473 


IQGGIAAYRVDRQERSNGRQGWRRNLSG 
TNMRRNKQKGLRRCKPRW/RSRGMAWKV 
DKPRDAR*KSDRDRQ/RGETDG*RVSGG 
LTA* EAYRH\HRGQ\EARJSQSGRSRQAG 
RQTP * QSQAGNE AE IRREEEKSRGQERE 
RE\KDRDADRHKG 


1872 


15773 


A 


1884 


138 


444 


CYIiTLIKCRI I YINDKVIVLTIWY*HK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQIP 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=^Histidine, I==Isoleucine, 
K==Lysine, L==Leucine, M-Methionine, 
N~Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














DRHID* WNRLQS P * LNPS I WVQL I LLNN 
AKNTQWVKDTLLDKWYW/ENWIYQCMRM 

PLGS YTI PMNQFKMNYGLS 


1873 


15774 


A 


1885 


27 


470 


TICCYIFIjFFSFFSLFFFFFGKGVSF*S 
PGGENTG/A3Sa.G* WNPPPPGKGISPA* P 
PKEPGMEGAGHPPGTKS*PGKFGPGPGG 
GPSFPYRGPNKTWGKPGPKWFPGGrGYP 
ATIPQKKQPEGKRGR\PGGPMSPYNGPP 
KTPTHGKGPG 


1874 


15775 


A 


1886 


478 


1 


KNVQEBTSALQAAMCRKMLI VCQTQCVI I 
SGESGAGKSVAAKYIMGYISKVSGGGEK 
VQVRRGKQKDKLGRPHLHGSAAPPICTP 
ASV*LIjPTPPASAP\HVKD1 ILQSNPLL 
EAFGNAKTVRNNNS SRFVSLCRPAWSS C 
SLRADLSTHTPTHTHTRV 


1875 


15776 


A 


1887 


1 


402 


HSLERPHYIG*LFKNIFSRD/RVFAMLA 
HWSRTPGLKQSTPLSLPKCWDYRCEPQH 

PAGSFFFFFFLKNGFWGGSLGGRAGGQQ 
*rjKS*WRPNPLG*GNPPC*PSKEVGTTG 
AHKKI ANRE IRTRACGGTNFSL 


1876 


15777 


A 


1888 


511 


124 


GTRRQHFAGAHVPPEGP*S\MLDPKLK\ 
DDRPARDHWIREPGIiLiLPRAPAQDAGKY 
YCHRGNLTMSFHLKITARPVLWHCLLRT 
GGWKVSAVTLAYL I FCLCSLVGILHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QLKP*ATKSV*KDTAFGIDVGNDFLAMT 
PKAQAMKGKIDKWGFrYR*S ICTAKETI 
NRVKR* PRK*EKI FAKPTWQKGQ I S/RI 
HKEFQQLN/KQKSNNLIERQTKDLNRFL 
SKECSKDL 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK 
KLTRGGCARYFPPLKSPRPRIPFPFFEK 
GEGSPNSKHSPAPPFWGPKETFFFRRTP 
PPPPPPPLCSFLKNNLFSYIPKGFLGGK 
GQNFHSFLPSFFFSIKKKLLGLGVHVRF 
YCRVNS CTGFFVQMY 


1879 


15780 


A 


1891 


1 


455 


NTCLGFGNGFLDATPKA*SMKKI INKLD 
FNETENFCSVKDTVKGMKRQATHWEK/V 
RKTHILYKDLILKIYNQLLKHHNKKTST 
X KQ * AKDLNRED I QMTNKHMKRCSGWGR 
WFTPIIRAIjWACKVEGSLELESSLGNIV 
RPHRKREREIHCY 


1880 


15781 


A 


1892 


1 


537 


rggiqapkevspegrqeparksli *rA* 

ETPP*SQ* /PIPEEP/TOVFMKKPVSVS 
LETGKHAVWPKVNGKELPDNPTIKWFK 
GKWLELGSKSGARPSFKESHNSASNVYP 
VELHIGKWLGDRGYYRLEVKAKDTCDS 
CGFNIDVEAPRQDAYGQSLESFQRTSKR 
SLYAIjAIjEDPGM 


1881 


15782 


A 


1893 


2 


514 


VRCQRRCHE*RACGSSLV3!IAKKLYEDAI. 

markvkqslpsldvetdedkfqmmslqX 

CSLAYGTLTKILSEKRSAKSYGMSSVRM 
RSAGQTSKAHLHQPRRVSQVLQVPAVNL 
liPFRKKGQTKDPALNTSLiPQKVLGTTEE 
I SGKKHTEDTISVASSLHYS PPAS PQGS 


1882 


15783 


A 


1894 


473 


2 


vmgesrgfsppptftggnfvffwkqgvs 
lfcpegfktlcsssppppppqkagvlgg 

SFHARPPPFPFFQI * IPFFFGIKKP\I»C 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I=I$oIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *==Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S PPP * PL *• KKRGIFPVPKGI FCEMES CS 
VAQAGVQ* YDHGSL.QPWPPGF* * SSYLN 
LV*LSS*DYKCVPPHPMY 


1883 


15784 


A 


1895 


613 


11 


FRPRS PACGLHAVS S PKLPGAQALSS PG 
LNLVFTAGSWDAG/LLRLSPRANNPRVA 
LPRVHTGPSST/DLSPSCPLGLACGFGT 
SQPACIi*SPPLLLAPARPS6CAWPGIiPP 
VCSLHLRGGWGGADPTGCLDAWAMAXiAS 
IiRPCLCPALiPLSREPPSPIiLNDLVLPRM 
PRRAPVLPLGRTPHSLLLCHVQS PSEQL 
PSKGPERL 


1884 


15785 


A 


1896 


20 


449 


KFGYS SAAARRQQLGWEAWL* YS FPLQL 
EPSAQTWGPGTLRLPNRALLVIirVKFBGS 
EVSPCVACGIQAAIiSMGSTSSVKIiljSHP 
QAPLPQWHQMVFARCLCMCGAQLNVPP \ 
ESFTFQVSTKDVPIiALMACALRK/KATV 
FRQPL 


1885 


15786 


A 


1897 


393 


3 


RPTAQSKGNI*VRVAS \EALS PKLLDFL 
PGKVTiNGEKVDVRPATRQNLSQFEAQAR 
KRECVRVPRGGIPPRAHSRDSSDSADGR 
ATPSENLVPSSARVDKPPSVLPYFNRPP 

SAIiPVMGLPPPPIPPPPCI 


1886 


15787 


A 


1898 


395 


217 


RER/ CKS FR/ PPA/HLQAKI KGAQ * QVN 
QAAAAQAAAPAAAMVSRDISSLLVSSQK 

SKVSNYM 


1887 


15788 


A 


1899 


1 


375 


NTVLVQ*NNKAWMTVHPF/TAWFSEYFK 
ATVE I YCS / EKI P FKI LLVFARVHSHPR 
TLiI E I * KE I YAVF 1 PAMTPS I LQ PMDHG 
I ILSSKP YYLRKASRAQRLTPVI PALWE 
ASAAGS PEVGSSGLA 


1888 


15789 


A 


1900 


47 


326 


VKSIIQFNSKNEPGKHDKTPSPEKTQIV 
WWLGSELRSPSYSGAAGGAEMGRSLESW 
RSRPQRAETAPLHSSPGSGSEMLSL*RQ 
HLTPTAWAGVQWRGLCSLRPRPPGFK* / 
FCPSQHHPQPPSSWDYATRCQATKQFVF 
FIiEMGF\VMFARLILTVELim 


1889 


15790 


A 


1901 


181 


837 


AGRVDRREPGMGTGI CKELETGSRETRS 
ASRWGRGRWRLGQACRVPQGLPLSTFHL 
GAQAKARGGTPIiACS SHLPNSHVGSLKA 
QRDEAMVQSGIiAPAVSSTCTRWT*GSEW 
* *GLHIVAARRQQGREEEPPRTTAAPET 
liCFQQTSWASSCSLEHSAQPSEVQVRAIi 
SVPSHSPMW\PQLSLPRDHRKPPGE 


1890 


15791 


A 


1902 


1 


385 


YTWGFRGKKPLIHCL*EYKLVQPLWRAA 
WRFSK* LRVEL* FNAAI S PLGVYPE ENK 
LFYQNSTCTCWFITALFITGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGtiELSTSGDPPTliASQ 
lSrVG/HYRREPLRPANTLQS*PLGLKQPS 
CLSLPSSWEYRHMY 


1892 


15793 


A 


1904 


2 


391 


IQPLISQRKYKTLGQNSLS/CCAHP/PP 
HFFFVLDS YFHSLLSQLKNAPFT * G* CP 
YWYSKSYSLIHSWOSSSILPCPIiTSSGF 
PSIjPPSYQPtjPCPSIiFLL*NSLSTLCLL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 


A 


1905 


3 


424 


YRAGCLQSLIjPPPrjLIiFLIiPCDVPFPPS 
S\LP*VKASCGLIRSQQNVGTIPCLQNR 
KSNKPLHKLPSLRHSLRAMQimiLPILiR . 
IGKPF/ IFFS *DGVSLCHPGWSAGVQIiQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ti'=Phenylalanine, 
G=Glycine, H=Histidine, I^lsoleucine, 
K=Lysine, lr=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R~Arginine, S'^'Serine, 
T==Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FTALHLL/VLKRFSCLSL 


1894 


15795 


A 


1906 


2 


389 


IQRGLDKSCLITDNIPSSQSLIQKKGVU 
PSFKSMKADRGKEAAEEKSEASRSWFMR 
EKERSHLHN/ IK*VQGKAACYPEDIiAS I 
IDEGGYTKQ*IFNEDy/MWKKM*'FRSFIj 
TREKI1TPGFKASKDRL.TPLV 


1895 


15796 


A 


1907 


458 


13 


AKEETQSVWDFPM/VPQGVYVNFPVSR 
NANLSTIKQLLWHRAQYSPLFHMLSGPE 
AYVFTCINQTAEQQEIiEDEQRRtiCDVQP 
FLPWRMVDCEGARVNKLLSSQISLFIG 
KGVRELDSLSDPEVSDFSTKMCQFCEKS 
AAL*DQATS 


1896 


15797 


A 


1908 


409 


3 


BKTDGLYRASQRGKDIiRRLCAQSFHASW 
KDGMALCALIHRRRQXDLIGYAKXiRKDD 
P IGNLNTAFE VAEKYLD I PKMLDAED IV 
TTPKRDEKAIMTYVSWIj*IVIAGAEQAE 
TAASRI CKVIiAVlSrREKKKLMEERV 


1897 


15798 


A 


1909 


116 


379 


HSGPRREGALIiLPKCLPHAKRCLLLFKM 
CSDGATALCCPGWS SAAPS * LTQSP /AS 
TSQAK*PSHLGLPSCWDYRCIPPHPANC 
LDYYY 


1898 


15799 


A 


1910 


418 


3 


qd*yatanrwficmiisqacflpsl*pah 
ril*l*qlmlfsfpgtpvfsygdeiglda 
s alpgqhmeap vmlwdes s fpd 1 pgavs 
almivkgqseapgsllslfrrXlsvqrs 
kersllhgdflafsagpklfsyirpmy 


1899 


15800 


A 


1911 


394 


3 


ileaypevkdpavkgasskkemygh* / a 
aeqalpvaseqeqqrhersekkqpqvke 
gnntnksekiqlsenicdstssaaagrl 
tqqrkigktypqqfpkklkeehdrctlk 
qeneektnvnims kknredv 


1900 


15801 


A 


1912 


499 


141 


pglgerdwtskygqgagegstrewasrc 
G/ 1 rpggdagqqq p rpe * s vc p rgahs p 
gpgswkaspawhsaepggrcsglgvqke 
gfpghllqpgcrtpgpgirkerfsgylq 


1901 


15802 


A 


1913 


127 


387 


I S FVF P P TL P KMPQLiKPET I SMTGLNL F 

qhlcnlarlatsaydgcsnsev/cdldl 
ly^aalplkldyqirfpnyfstkyriyy 

LCLY 


1902 


15803 


A 


1914 


504 


0 


PGPGQRKHSTAPMCLLNIYAQTLRFVIoA 
NQIYKCIKRIIHHDQVGFIPIMQSWFNI 
QNQ*\INLIHHINRLN*KNHMIISFDKT 
HHLFIIKTFIKLGIEGNLLNIiIKNIF/S 
KITPAANIILNSEKV 


1903 


15804 


A 


1915 


46 


415 


YTSNKQLQIQILKITYNSTKKYKILINW 
IKDYKDQYTBNQKILLREIKDLNK/YRD 
TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NLSTL* riiTTSF*FAWKYTGTGIYKVTM 
KNRVGRLSLPNFQNY 


1904 


15805 


A 


1916 


420 


1 


ENADCVERARKSPDSIP* \RGGQISVTM 
VSPNEQEKAGQIiAIGVRAVRYNGV\LLA 
KMWRKKLHLTLSANLEKIIAIGLPFSNP 
ERKPPENTFLKLTAMATHSESNLSCFAQ 
EDXAI CRPHPAIKMPEIOIEQ YKPLTASV 


1905 


15806 


A 


1917 


384 


3 


TRTITSGQYSTHVIRASRVPTRS * VPVF 
RSCTSNRRFSQAIEPRVMH*KVHIRAST 
VRYDSGGHVAVYPADTDSAI^VYQLGKILG 
ANLYVVMSLNWLDEESNKKHPFPCHTS / 
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SEQ m 
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nucleotide 
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NO: in 
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amino acid 

residue of 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=:Isoleucine, 
K— Lysine, Ls=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=^GIutamine, R==Arginine, S='Serine, 
T=Threonine, V=Valine, 
W=Trypfophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














RRTALT*YLDlTHPPMy 


1906 


15807 


A 


1918 


405 


105 


KAEAQRRREICSGPHSQADSAWHPLVTW 
WH* PPPVARSSQEPGVHS PVP /HVCRKI 

liLIRPKMAIxANEGMYRELRWFTPWSRSR 
*VGCLTTPGMCVKHLRCV 


1907 


15808 


A 


1919 


409 


3 


GGQEBGVFRVLWSKEFETERGKP* IPPV 
KDQEIXH\PTKFHHVAHMGPGDGKQVLM 
DLPLSAVPPSQEESPGPAPTNLARQPPC 
RNKPYISWPSSGGSEPSVTVPIiRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


1908 


15809 


A 


1920 


9 


470 


APARNPLPRPCTWPTGP*CLRPARPPVA 
S ACLCGGTWNLALWLCS PGT / P VPFLTP 
PCS / S CEVQQPASHS VASNQSKEPAKSA 
AVAHECPPGGTGSADPGWPPGATCPESP 
GPATPHTLGWEPGKSSPPTMEEEPWAP 
QGS PCWTVRQRMTMMM 


1909 


15810 


A 


1921 


556 


1 


IQLWAALGGILRRVGRDPPSHRIGKEPS 
AMAGQAGGNGDGEEGSSXGGLAAVPHLP 
* SQHAAHPLI,IGPPGQQNLGDSKV*GFP 
S PRLEENTLENGGWGS KQLHGS PGS QHA 
GGSWKNGETSIiKG/PH*ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS/PPPHGSPA 
PSGTSPLPMSAVSLLPPGSL 


1910 


15811 


A 


1922 


567 


41 


GGWGETFSRLGNDLQAHR* SRFNAQAQE 
ETSR\VLAVSLINEALDKGSLEKTLSAL 
LLPAAGLDDVSLPVAPRYHLLLVAAKRQ 
KAQVTGDPGAVLWLEEIRQGWRATQDA 
NTAQRMALGVAAINQAIKEGKAAQTERV 
LRNAAVALRGWPDCANGYQRDLESAMA 
KIQRPAAV 


1911 


15812 


A 


1923 


2 


405 


IQGCGlTSSSVhKGMVFKKETEGD/VTS 
VKDAKIAEYSCPFDGMITETKGTVLIKT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
WVTGGKVADMALHYANKYNMMLVKLNS 
QWDVRRLCKTVGATALPKLTPPCL 


1912 


15813 


A 


1924 


510 


37 


LLGHAFHVQSSSGRPQLAEASGHSHLKK 
GECVQQRTGNVGLSPNTARWGTPLGPS I 
SSSAPPWSFSAFPGPPGSGK*AAKDGPG 
CSLKSGLKR\RSQGGLR*TRGSGPSPMP 
PS PS PS /ERPPPGDEGLLPGTPRGGLPG 
PKINTACVCAADI S PGLBP V 


1913 


15814 


A 


1925 


74 


429 


ATIPGHELLLLFFFFFFFFFLGKGAWPIi 
G*GKG\GGPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPIFGFLKKKGAPPG 
GPGVPKTRGQIEPPPWPSKRAGVTGWTL 
GPGPKV 


1914 


15815 


A 


1926 


515 


304 


ALAASIjALALNGVFTNTIK\* XVGRPRP 
DFFYRCFP/GMG* PHSDLMCTGDKD\ W 
NEGPKETSPSGHSSPV 


1915 


15S16 


A 


1927 


I 


433 


nwgsnkkskklyfaeieksylitfiwnl 
kgpq*vkiilkkkssvtkctiiihska*y 
kv/lki vwywqrdthm/dhwns i kqhmv 
kqffdgdtktvqwakdclynnwcwbkXw 

ISTSNRMKFNIYPTP*AtsriNCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


356 


GGT*PPQAATPIS*LYLPITSSIj/TLPP 
APSLPLQIAPIPSDSPSPRYS/TPTPPI 
SLGLAPPTTLLIPVPSLPVSPRPIiNSTA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K^Lysine, L^Leucine, M«Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginjne, S^erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PP/TFLTGSGAAVTWSLAVSLSPAPAS 
AMDKAQWM 


1917 


15818 


A 


1929 


490 


11 


PTRLTCPGSHVPGPVTFPPYETHSMP \ P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSSQIiPNTWVHPSQRMR 
TWLQQHPPIiPAPPQP/ PSPGS I CSSMTH 
LTEYLASFPAPQ/PREQAPQGGVPTSLP 
QW*APQPTAPCRATQSQQLLDG 


1918 


15819 


A 


1930 


1 


552 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLL\PFVKSRKRAFPSP/LSPCSSFV 
KSRKRAFPSPPVPAPPS*NQGNERSRHP 
QSLLLLREIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRDHTFLPLPEPRQPS 
TVPVGCFGFSRIPRRWNHTGCAFLrilCH 
SARLLCDPRVSSCWRHS 


1919 


15820 


A 


1931 


414 


3 


RVPGESRK* ERVLDSRHKD * EGRRRGNI 
\MGKRVDYTT*TVITDVTNLS IDQVGVH 
RSIAANMTFAE I VTPFNIDRLQELVRRG 
NSQYPGAKYIIRDNGDRIDLRFHPKHSD 
LHLQTGYKVKRHMCDGDIVIPNRQCI 


1920 


15821 


A 


1932 


521 


103 


ArEPAGVRLKEGGNITESFVTVGNVISA 
IiADFSQDA/SRYS * KKKQVLVPYRDSVM 
TWLLKDSLGGNSKAIMMATI S PADVNYG 
ETLSTLRYANRAKNI INKPTINEDANVK 
LIRELRAE I ARLKTLLAQGNQIALLDS P 


1921 


15822 


A 


1933 


1 


490 


NTGWWRKGEGNGTLMVRRLPCVPLCS S 
AGGLTEEDEGELCAAGFPLDAEDFGQAL 
QQLQTAHS QAVGAP / KGGDQ VLGGGTGA 
PPTAYPNHALSIS* IPSVSWHDEGALPA 
VKIEILGIIHLLPERHALLSLVQARSGL 
LLHGPPATGKILLNKAGTTECCLT 


1922 


15823 


A 


1934 


383 


151 


EVAFLENLiIKDDIERGRLPIiLLVANAGT 
AAVGHS \ S * LGRMKELCVQCVNRPKMEG 
YMHVSQHPVPESHKMRAKAI F 


1923 


15824 


A 


1935 


379 


1 


Y VPWS KDKE YF FNS Q * GLTTHQ I LPY I 
DGFRHVQKI SAEADVKLNL VR I AI Q ILL 
*VGLQSYLGQGHQPGKS/CRGPGCEGWE 
GMVLRS * AQLSLSGTTAL*HWCPSSRKL 
IQFGIIKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 


GCLFIYMKPTAMSSSQVARSGEVSPFTA 
VPA* S * K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA*SGC*DVERSSSHSH 

GTAHSP\HGTAHTPMEQHTHSHEQHTHS 
HEQHTHSHGTAHMY 


1925 


15826 


A 


1937 


426 


1 


KHGE I IDDLLKVYRDNASKKSAI / YSKW 
XTHWKKRRDDVGEEVHSSRPATSVCEBT 
IHLVCALr*ED*LIAAETIANAVGIPRC 
SAYTILT/E /KLKLSKLSTRWVPK/P/L 
LPAQLQIREKRSMAIIiWKWNQDHEAFLH 
IIIAGLY 


1926 


15827 


A 


1938 


2 


469 


KRRKLVSSIAAAIHPALSS*TRQRFLLL 
LFLFHIVREDLVQLRIKKKHTIQIGNVE 
VKLL*PTDNIILCGKLYRFHRSTQKVXiQ 
L I /NFS I VAAYKINMHISLVFLYSNDE* 
LENKTKQTSSFTIALKRIKYVGSNKKSK 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 


I PLSTHS PGMGTSTHLS TEIHHLS F WTYH 
ERLLNPGRETDWA/ IMLFRWGLLMLPRL 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=1soleucine, 
K^Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=G[utamine, R=Arginine, S=Serme, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \»possibIe 
nucleotide insertion 














VSNS *AQVIPPTSASQSAGI TGMS \HAS 
GHLIFNY\CSM*WLHLRNSQ*KRCTGQE 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSIiPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


GtPGG*GSTTRNALYHVMNGEDWXLTT 
CKHGKDWQKHKDSRCDRDNTEYEK\YDF 
GEMLHNATFCLVHRGRRHGSFRFLSAMQ 
AACVPVMIjSNGWELPFSEVINWSQAAVI 
GDERLLIiQIPSTIRSIHRDKIIALRQRS 
QFLWESCIATAIEDPMLIGWR 


1929 


15830 


A 


1941 


1 


423 


NTTLIPAGGM*3SrVCPGPLCARLLHRSLT 
HATPDPPLLTLP*IPTPTPSSRCAPKAP 
PHLICPTSCPLCSKPHPWPP/CSSCQ 


1930 


15831 


A 


1942 


48 


417 


RLTMYQVL YRLHCTAIiQPLPPRVK* FS C 
lslpstwnymhmpqcptnfX CIFTRDEV 

PPMLP\SWWRTP 


1931 


15832 


A 


1943 


450 


2 


VHRRSNRQNMDTSR*RGAMQSVESV\GV 
PY\EQ*TIVDGI\NSGVWEGIAYA*IEE 
RYPEEFALRDQEKYLYRYPGGE / AYKVE 
T I KLNVE AVNTRRD KPXNI FAKNQAP VR 
MRRNS FTPLSSSNT IRRPRNYS VGSRP I 
KPLSPLRAQECI 


1932 


15833 


A 


1944 


451 


3 


GRITRHLPRRAEDDREREREPSPLPSRH 
PMFPPSVTPKASSDWP/PASSIPCQACH 
G / PPP VS L PRKPAHRS C\ P VFP P \ ASGG 
DTSIHSGKTVYVKRKSQPAWPVLPPGGL 
*APSHGAPSPSPPDQHRHCPEI\R*DLL 
PAPAPSPFSIPPLY 


1933 


15834 


A 


1945 


402 


3 


VRL,LVSWKVMYP*SRSKASL/HPQITAS 
LTGSCVNCIVILILNFFYEKISAWIAKM 
E I PRT YQE YESSLTLKMFLFQFVNL YS S 

wfyvaffkgkfvgypgkytylfnewrse 
ecdpggclieltnqmtiimagdv 


1934 


15835 


A 


1947 


1 


405 


ntgwrvfailcslkgrprgierv/ggkk 
kkkkkpggplgpaqgktpktqkgggaqg 
nrgkpplfsggnfgngrifgnplpapgp 

GGGGGVR * KTR\ KRKNGNWDLLKGGGKL 
VFCPMVGKKLSGVPGV* QKNFKGGW 


1935 


15836 


A 


1948 


443 


1 


ltpgaanasrilg\ccmedlsvwg*rqgl 
weallthnmvagcrleevdnayghyeaf 
stlapkawifsvelaepcvpepgiippvfa 
nfiqllsa/pwvteggtawlewwhvqp 

MLALMSAELRKSQVLNRVT * GAHYSDC I 
AAALRIKIT 


1936 


15837 


A 


1949 


396 


2 


GNGRGGSVPPNSLNEDGISCAI*/RHIN 
WLNG*TPTIYCL*ETHLICK/DH/HRLR 
VKGKKKILHPNGNQKPAGVALLT*GQTD 
IK* KAI KSNKEGHYE IKGS VKQENI TVG 
JSTIYAPSTRAPRYPCKQLLDLKGVN 


1937 


15838 


A 


1950 


419 


1 


RWQ/PSARPPTPSGK*GASLPARPSSGT 
*GAIiLPGCPVWKVRSASSWPPSRLGSEE 
PLCPA\PSHIiGSEER\PSRPPSHIGSKE 
R\PARPPIA*DVGSASAPP/LPSGM*ER 
PAAPPSGR*6ASLPGRPSPEMWGVPLPR 
RPVWDVRAPSRAPWEVRSVSAWPPHLR 
REKTLRLATAPV 


1938 


15839 


A 


1951 


422 


3 


QNHVIMS E / DAKI AFGKI QYP FKME S LN 
TL * MKVNFLNLIKI TQiOiTP lANTMFTGE 
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SEQ m 
NO: of 
nucleotide 
sequence 
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sequence 
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NO: in 
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nucleotide 
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amino acid 
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sequence 


Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=XJnknown, '*=Stop codon, /==possible 
nucleotide deletion, \-possibIe 
nucleotide insertion 














mle/afli.rqgkrsc/csijSpilfi ill 
evlps * vwqek* / iqg* eree *ncdfvd 
dmi vyven* kkplynll* i is * varfkv 

ILDTCI 


1939 


15840 


A 


1952 


2 


409 


CAHYSKNPPQRPLRTDARLPAV/DFWSL 
GAILFMVGCGQPPPQEAtTDSETLTMIMD 
CKYTEPSHVSKEGKDLITRTLQIDPRRR 
ASLQ* lENHPWLKGVDPSPATKNNIPLV 
* YK\NLSEEEHNS I IHRiVlALGDITDR 


1940 


15841 


A 


1953 


2 


367 


IQCVRL*VHVCLCASSCVCICLYVHATL 
CVSTCL*CVAVCM/CLLCACATVSACLC 
/V/CGCVSTCVCVCPPVCTCDCVCMSAC 
VCT/ CLCVCVTVSTCVCERLCVPTCVCR 
CPCI CSSLNGNEWMG 


1941 


15842 


A 


1954 


374 


2 


EAPWLLRAGRGALPPCWSKTPPSPLIiFX 
PPLLPGTLVYQPWVPLPPWNRIiACAVTL 
ST*ARAGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLS PWTSHLLPGCCHNP 
CHCPFPHHKPTRV 


1942 


15843 


A 


1955 


1 


411 


NTPSPELHPC*PGL\PPLPSPPQQPPTW 
APPRTSTQQKLPIIiCLLKPSAHTDAPCT 
QPGSTLPLHTPHTQQAQGTAYQIHTT*A 
AP P PGAKPG * RCP P P PP SRQ PQRMQTPA 
PGNPQ P C PRIiTTLTRVLVP AVP PLP I 


1943 


15844 


A 


1956 


33 


451 


RGRNTFGPLQSPPPRFK*FSCIiSLLRSW 
E E \ RD YRCMP VHPANTMLAS L VLNS \ CD 
LPAIiASQSPGITGVSHPTRPHIiS^FLMP 
LE*GHHHLLLVLLSQPPNMPP * FQRFST 
GQLECSSFRTNLIVPSPSLNFHSDFPLT 
S 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHSISKINTDNTKSCK 
G*GSTESLFHDRWEYKFVQLLiWETVWHY 
VR*TFTILYNPEIILKR/IFRHTYKNVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/PPPPGEPE 
KWPD\QGGGGGGDPKPPGGPPP * ATGQN 
SAGKKKKKKI PRPGAHGADPLLPGKAGG 
EEWLDPA 


1946 


15847 


A 


1959 


407 


238 


TQAFALI / KDGGVIGGICFRMFRTQG IR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA*GLQGNPSPQ 
AAAPPARPGLVSGNCLT*EMAQAGA\GT 
GGSLSASLGKRCPHrPVPCE\PVL*GLC 
VPGRSSLGELGGNPVTVQSFPGPGAESD 
PW*GDAECCSLLQASFGDRAGWSIR/RG 
S VGRPAGVPRGKGRKPTLSGA/ SGPGS V 
LGGFCCPEPLSREAESGMVQE* *GRFWT 
GQBRTPTGR*GCKVAGYLHSSATVGHSG 
AGAGGSPGKTSATLDVGQGLGQT/PSGP 
AWDRTNGYMEKAALPLCNGKVTGNTQCG 
AQPPASGPSPWVWQPLLLRQRLRQSTGL 
* riLFP6VGGGI.QPGEG\GQPFPLSPWGS 
IiTGRiCNrSNKARPGGKFEGTEG*GARNGAS 
LTWVWTVPNGGYCPQAGRDFWLVDSRFK 
PSLGLRAMV*NSRKAPLSFEDGRMGTV* 
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Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen y la Ian ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ]VI=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 
G/PGGKERSISPPLPRGL*GP /VGERGR 
NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPPGPGGRGPIQT 
HSHSHLFPWLETSAGSSPPNFLCSLPPY 
TPLLRIiLPSPFLCPSQTTQRMPlPAPHP 
SPASPCKVP*DARGAPAGGAPSTri/RGH 
HPPHP 


1949 


15850 


A 


1962 


375 


131 


FliVGRIRIjriyCLREIiFVYI/LREQKAKRN 
SQWVHTLPISSRHVDAVPCSPS INRNRM 
GRDKKRP FPL\ CVDA*LHS PSWIQP 


1950 


15851 


A 


1963 


89 


538 


GLILLQKWHPGAVS SMVERG* LLHSGLF 
FSFPDKVSSWQAGVQWHDLSSLQMAS 
GSKRFS SIiGDHRHAPPHIAHFC I P/M* T 
QDFAMIiARIiGTSFIYLFGCCC\PETGSH 
S VAQAE VQWHNHS SLQPHS PRLRRSFHL 
SIPGSWDHRNAP 


1951 


15852 


A 


1964 


402 


42 


CCHYPGTPGLQGRIiSASHQAS / SGPQLP 
APLPLQVLPFLALGIGVDDVFLLAHAFT 
EALPGTPLQVGPCP\QGSSEAAQLTG*E 
PLGSSDIiGLLMNLGASCPHL* TGEI IVL 
CPKGYCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPALSLPRPPVFTRRMGPQ 
GSGIiP KLPVSAP S SRMGS F PAP S PLLIA 
SGDRHLCPDAAALPLLAIETGCDS\PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 


15854 


A 


1966 


521 


2 


YELyGIIKRHRREVSNNQQVRYKSHSIP 
PDYVSSVPTDPTWGPERREEESSGHFMV 
DHTGTAAGGGGGMIIxASPKLGATPIiPPE 
*APA/PPPPPPPPPPPGVGSGHLNIPLI 
LEELRVLQQRQ IHQMQMTEQ I CRQVLLL 
GSLGQTVGAPASPSEIHGTGTASSTKPL 
LPLTV 


1954 


15855 


A 


1968 


111 


395 


YLMRGFLLHHNMVKGKEDERKTKGARLI 
LFFFFFPKKESPSPPRVEGRDP\NIiGT* 
NPLPPKVKKLSRPTPPEKWEPGAVPHCP 
NNFLFFRKNGGS 


1955 


15856 


A 


1969 


413 


101 


D VNRHFS KKNiGICQ * L VI KEMQ I KTBCRR 
YHFS PTRMGKIKN\ KKENKFWQGYGETG 
I FTHCWWE CKMVQ P I KKTVWQFFIiKVK* 
LI4VFGPACKELGSYYSLVTKS 


1956 


15857 


A 


1970 


411 


1 


NLiTPAIiCTKVHFKWVMDIKMKGKTIRLL 
ENNTRKYI,HN"IKIWKGLNRTE\KP*TIK 
KKTDKFDYDTIKNLSSSKNKTRQTIDWE 
NXli\YIiQYXTDK* I ISGIHKAFLKV/ YK 
/ KKTNNP VGKWARDUTGHLI KND IHMAC 
I 


1957 


15858 


A 


1971 


3 


828 


gqachf i frs aqaggsrlri wrakqlgl 
wrvcalimpalehsfptrliiqgnsvpgp 
s i sldts cl/ cnrcvweggrg* t6pglp 
slgkvlvegipsespgptashpccsprp 
dpdq/lsc i s aps atptvh* shltlpmg 
wpgpq*glhlsdqpgprkp*pgcsgiigg 
gdaaprgmekpXpppqlp 


1958 


15859 


A 


1972 


398 


3 


GPGCYFSVRLQCP/RKIPAWKRAVCSAT 
LI \SIiQGPSLSAPHVLGIiAAIiAVHLGES 
RSALPEVDVGPPAPGAGLPVPALFDSLL 
TCRTRDSIiFFCLK*ALLPQSRLLSSIiSK 
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SEQ 10 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GIutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=GJycine, H=Histidine, I=I$oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-ProJine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=p05sibJe 
nucleotide insertion 














SDTCSFDiCIRRDRHADIPPLY 


1959 


15860 


A 


1973 


2 


433 


QDPTKMTKEEI.NALKSTGDGTLG/RASE 
VEVKNEIVANVGKREILHNTEKEQHTED 
TVKDCVDIEVFTAGENTEDQKSSEDTAP 
FLGTIiAGATYEEQVQSQILESASLPENT 
AQVESWEVMGAPDDRTRTPLEPSNCWSD 
LNGGS 


1960 


15861 


A 


1974 


373 


580 


TIFSRQVLRIQNAIiSDKPNVSTVYSNNG 
SELHGTSEASV/YHWKILISTEYHKTL 


1961 


15862 


A 


1975 


47 


413 


KWKHLRGSEHWRPQENQVHQRIAELRKA 
GLWSQRR/PAEAAGGPRPKSHWDYUCiEE 
MQWMATDFAQERWKVASVKKMVRAVARQ 
LQDRTRREAGARREEPSRLRQTS PVLPE 
KSSVPGLVIiRR 


1962 


15863 


A 


1976 


444 


3 


GYERSRGTTSGTHSSRDYDGIFGQATVK 
WAMLDQ FRMIiS PCS KEVMHQPFYLKRVE 
IMAQCEEWIADIQQYSSDKRVGRIMSRH 
SAAIKRRTGQLREELLKLPCPEGIiDPDS 
GDAPEVCRAATGAEETIMHDQV\QPSSS 
KVLPSDFQ 


1963 


15864 


A 


1977 


281 


567 


PRSSLQGQELSRPWGRKGNFPGPPVPPP 
QRMFYDSELFSGISDPSCCF/RSAPTAD 
QVYGDQDMHEWRKHCMDYLVRKWRPLG 
KGQGGKISSPHRP 


1964 


15865 


A 


1978 


357 


2 


KQILGPPHPQAQPGRWPPPHGPKDAPL 
WSSRAAPPGRGLGRAGPAAGVEAGATLR 
DSSPSTWTREGLHVQAQRKRPSHVHKG/ 
SGPGCLEDGEFPTSLRLQAQIiAE IGRGN 
GLSVRRQ 


1965 


15866 


A 


1979 


29 


434 


VQAEAEGLPGDTEHPQPQLMSRSLEGQS 
DVTIKHVACGDFFTACIiTDRGI IMTFGS 
GSNGCLGHGSLTDISQPTIVEALLGYEK 
AQVACGASNVLALATEREL FALGRGDSG 
RTGARTKESH/YLPQQVPMP 


1966 


15867 


A 


1980 


3 


2082 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRSPGYESESSRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLQNPLGLTGALRGPG 
RRGGRARGGQGPRPNI CGICGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
• DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLIiRHQRTHSGQKPYKGPHCGKAFGDSS 
YIiIiRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGICGKSFSQRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VLAIHARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLIiQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLPYISQHNLIRKYKK/WP 
6AVAYACNPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 


3 


424 


egqaivermnlclkqqlqkqkgenryyr 
tphkqlnXalltlnflslpkgrilsaae 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C==Cysteine, D=Aspartic Acid, 
E=G]utamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=']Vtethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne» 
W=Tryptophan, y=Tyrosine, 
X=TJn known, *=Stop codon, /^possible 
nucleotide deletion, V'^possible 
nucleotide insertion 














QHLQKPAAKTEAEQIiVWWRDLITESWEI 
GKI ITWGRGYAYVS PGL/NPSRHLKPYH 
ERMLRKRFREDPEDPPSCSHVKTDAEED 
PN 


1969 


15870 


A 


1983 


3 


399 


YSKLSFKGTLTKFRRIYSSSFYKBFQGC 
ISDLCHP/R/TLTNCLRGELLKENLVWG 
AFGCHPHFPCYYNKSQERNLIiQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHK7FKR 
QLELAASIiEK/PIiVILCQQADE 


1970 


15871 


A 


1984 


1 


405 


RRHIGGGVRLYYIGGEVFAKSLSDSAI F 
AQTPNCNQRYGWHPGTVCKI PPGCNLKI 
FNHQEYADLLDQSVNQGLEADY/QI.TRM 
CTILMSLIiKGWGAEYRRQTATRTPCWIE 
LHLNGPLQRVDKVLTQMGYPS ILM 


1971 


15872 


A 


1985 


27 


452 


QGREHAQGGQSPGAGHIiGPTPEPQPEPQ 
PRPSSQAVPAGRWEPAQE/PTRHPHPRL 
SPASRPLGSPPAPLVRSSPGRCRLHEHT 
IWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSSVRLQQPHMHTPGKIMPDPSKRHGKF 
TFT 


1972 


15873 


A 


1986 


414 


220 


GAEQEELLSP/GSGGCSELRSCHCXPAW 
ATRAKLRLTKQNKTKRESYRQGNQHWGR 
ILGELLGRL 


1973 


15874 


A 


1987 


52 


412 


TRERKLFFACDNVWKHLKRYLRKNSFGE 
NliWSSRNIKKKKKKKKKKKKKKTKNFST 
PKNFFFFKACQCMGKKNTKHDFHEQNF I 
EHALLQQLNINFSTNYIEGRVF\HPGAP 
lESLLLMCH 


1974 


15875 


A 


1988 


2 


143 


EKRRGLDKRTP/AQAAFEKMQEKRQMER 
ILKKASKIPPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


THFQPSEK/WGEDLGDNTWEYIFAIDLL 
CCHQKW I CHPLFIiVG WRAGAE VS GVF 


1976 


15877 


A 


1990 


1 


439 


DKTAAEDAIRNLHHYKLHGVNINEEAST 
NKSKTSTKLHVGNISPTCTNKELRAKFE 
EYGPG\ lECDILKDYAFVHMERAEDAEE 
AI KGLDWTEFQGKRMHEQLSTSRLRTAP 
GMGDQNGCYTGGKDGHWSKECS IDRSGR 
VADLTEQ 


1977 


15878 


A 


1991 


1 


145 


WAASKAMK/MGDWKTCHSFI INEKMNG 
KVWDLFPEADPVLLKRLRESR 


1978 


15879 


A 


1992 


2 


425 


NISTLKKTLESDCT\KLFSQGIGGEQAQ 
AKVDRCLSDLAAETNKFRDLLQEGLTEL 
NSTAIKPQVQPWINSFPSVSHNIVEEEF 
ND YEANDPWVQQL I LNLEQQMAEFKASL 
SPVIYDSIiTGLMTSIiDAVELEKWLKST 
FN 


1979 


15880 


A 


1993 


3 


449 


vagpapga6arpgldlqplqrflq ilkv 
lfpswssqnalmfltllcltllXlksfd 
qftcnllyvswrkdltehlhrlyfrgra 
yytlnvlrddidnpdqri sqdlerfcrq 
iissmaskli ispftlvyytyqcflstgl 
lgpvsifgy 


1980 


15881 


A 


1995 


I 


410 


SSRRPFIALKTKSMRDLKTPEDIDQLITI 
SGMVIRTYQLIPEMQEAFFQCQVCAHTT 
RVEMDRGRIAEPRVCGRCHTTHSMVLIH 
NRYLFSDKQMIQLPESPEDMPAGQTPHT 
VIIiVAHNDLDDR/VQPVDRVNDSGFF 


1981 


15882 


A 


1996 


1 


154 


LFFFRLLVRYTKKVAQVSTPTLGKVSRN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, l=Isoieucine, 
K=Lysine, L?=Leucine, M=Methionfne, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














hGKVGKKSm<MPEAK\l<RPCGm 


1982 


15883 


A 


1997 


2 


475 


KKLRRQTRREAQKE \LPDKV\RWGLVTP 
SETPVGrSNLSPFLGPRCL/QNPAELKP 
PWRFPWAQRPKAH\ ERANAGPKTPAEQR 
KVKKIKKLKEDISQGVHISVYRVRNIjSN 
PAKKFKrEANAGQLYLiTGWVIjHKDVNV 
WVEGGPNAPKT I TRLMPLRI 


1983 


15884 


A 


1998 


3 


431 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTESRWKSEESVESDDEYLALPARLTQV 
SSLVSYLGSISTLVTLPTGDIKGQ/SPIi 
EVSDSDGPASFPSSSSQSQLPPGAALQG 
SGDPEGQNPCFLRSFVRAHDSAGESSLG 
SSQA 


1984 


15885 


A 


1999 


1 


400 


ALDLRGIjQILiVGFPKRRVTTCSYPTAIiQ 
SP IEYQRKERSTAA7MRTEPDSAYQASPR 
PYSAGPADSKKPTKGYCYNPTLP\RLE1 
MTLEGTTG 


1985 


15886 


A 


2000 


1 


372 


QNIDLVISFFSSRIjIiQAGAELSVERVLE 
IIKQGWALPKDRL/RGSCAPTLSAAGR 
SSGGQSPCMPGLCVCSFWVLTVSWLVCQ 
KFPELKFKYVEEEQPEEFFIPYVWSLVY 
NSAVGIiYWNPQD I 


1986 


158S7 


A 


2001 


393 


1 


GGIGRGGGAGGGVGAAGSASGGVGRRGA 
GGVIADSGAPGGGVEGGVGASGGWRE /G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRSRGTIGGLGGSGSAGG 
GVGACGGASGYVG I RGAGGG 


1987 


15888 


A 


2002 


2 


362 


WVT F I S LLFLFS S AYS PGVFRRDAHNS E 
VAHRLKDtiGEENDKALLLIAFAQYLQQC 
PFEDHVKLLNEVTE FAKTCVADESAEDC 
DK\SL\HTLFGDKLCTVATLRETYGEMA 
DCCAQQEPE 


1988 


15889 


A 


2003 


2 


358 


EANRGWFIRLKEGSQLYNIKVQGEAASA 
DVE AAAS YPEDIiAKI TDEGGCAKQQ I FN 
VDK\QTAFSWMKRPCRTLIAREEKSVPG 
FKL/SKWRLALLLGANAAGGFKLKSVLI 
CHSENSRTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFLSFTVKAVTFNGWVWWLTPVIPA 
LWDYRH\RPANFFVFLVETGFHHVAQAG 
LKLLGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLIFVF/CSRDRVLHGCSQTPC 
LKQFSGLGLPKCWDYR 


1991 


15892 


A 


2006 


2 


134 


PMTFFTELEKTTLKFIWNQRRIQIAKAI 
IiTNQK\NKARGITNIC 


1992 


15893 


A 


2007 


315 


127 


SEIAFFFCSFLKIILDTR/FSFFARAGL 
KIiLASNDLPSSSSQGAGITGVSYGTQPV 
CPEYNV6 


1993 


15894 


A 


2008 


3 


325 


RCSMLAVREM/QKATKRCHFIiPTRIALI 
K/ND/GSNKCWK\HCWWECKMVQLLWKI 
VWQFLKKLNIELPFDPEIPLRDIYPKQL 

KTYVHTKTCGQMF I AVLF I LNVPHLMNV 


1994 


15895 


A 


2009 


369 


3 


VGQAGLE FLTSGDP PASAS QSAVI TGMS 
HHTQPIFCIFGFA\GCPDWSQTPELKQS 

ahlslpsswdyrcmpphlanfyfcrh/r 
vccpgwsqtpglk\yppalvsqsvattg 

tsprawqdtilpv 


1995 


15896 


A 


2010 


114 


287 


APLCLCLRHLRLL I KRLTVLGTE AHTL \ 
NP I TTRGRGGQ I T WGREFE TS laANMVKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQIO 
NO: of 
peptide 
sequence 


M 

eth 

od 


SJEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Pro1ine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, '^=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CF 


1996 


15897 


A 


2011 


218 


346 


IMKFNLLKFYFIYLFSRQSL/SSVAQAG 

VHWRNLGSLQPLPPKF 


1997 


15898 


A 


2012 


310 


1 


SSSSSNTHFGIPKYIjINPDTCFIiA/KVN 
NSSLTGLEYTETLKPGIK 


1998 


15899 


A 


2013 


3 


301 


SDCCASNQRDSGGVGPSEPARKHTLCV/ 
CNSLDLIE ■ 


1999 


15900 


A 


2014 


332 


3 


LRDPLEEAVCPFSDLQLHAGRTTALFKA 
VRQGHI.SLQRLLLSF\VCLCPAPRGGAY 
RGRQAG\SLSCGGIiHPVRASRLLCLPKQ 
AWAMAGAPPPAWPRP CSL I SD CCASNQR 


2000 


15901 


A 


20.15 


2 


325 


LLLHHAPPVNLFLRDRVSLCGPLCGPGC 
SQTPGLKQSSCLSLPKC/WDYR\RATAP 

GL 


2001 


15902 


A 


2016 


373 


3 


WAHGLQPGKP KVD S LSKKRKKKKGKKLN 
TFPLRSRTRQACLLSPLLSNIVLEVLPN 
E IRQ / Q KKE I KGTHTGREELTLSLFTDN 
MI I YVD I PKQSTKKNQGSYSVARPGAVA 
HACNPSTLGGLGG 


2002 


15903 


A 


2017 


343 


1 


EFPFVSGSRATGKSSDIRATKYIWRVLE 
YLRAWPRGQRRLKSS\HTSLLGSYHPGA 
PRGDKWSCCHQKDETGGRGEDEVLLCCP 
GVLGCSALCRSGVRTKFGIRGRPWKERE 
R 


2003 


15904 


A 


2018 


139 


2 


NS I S TKNTKN^SGMVAHAFNPS TLRGRG 
GQ I \ RGQDFKTS LANMVKP 


2004 


15905 


A 


2019 


3 


322 


ARELVFFFGAYRKGFFHRDMKPKNLLCM 
GPKLGKIADFGLARELRSHPPYTDYGST 
KWR/YRGNPALLRPTPMAKFPYEGWLHI 
SGSSIWNIKTSDSNFIFFKESKG 


2005 


15906 


A 


2020 


205 


377 


NIVENIVFCWPGVCFLQTCTVCINPETS 
DE/WPGAVAHACNPSTLGGQDGQITRSG 
DRE 


2006 


15907 


A 


2021 


3 


324 


kkwgkrxlnrapngrrypetrwaleedq 
chickelhiktvrfhctpirmakihttd 
npqswprcgtgtlihcrrgcktv/r/pl 

WKTVRQ / FL/RKLNI PLPCDPAVLSL C I 

Y 


2007 


15908 


A 


2022 


2 


382 


RVSQDGLNIiLTSRSTRIiGLPK/ CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWNPGGRGCSELRLHYCTPTWVTERDS I 
S KI AKNK/NNKRPRlsnsrCR 


2009 


15910 


A 


2024 


339 


3 


SWDHRRVLIFVFLVQTGFCHVGHAGLEL 
LTSGSQSAGIAGVSHRA\GQKHQFRPEH 
RFLKFGFVFRDRVSIiCCPGWPQTPRLKQ 
SSRLSLPKSWGPATALDPDSSYLEEMVL 
S 


2010 


15911 


A 


2025 


2 


146 


NTFGRSRQEDHLGPGGQACSELRSHHCT 
PAW\VIEQDPVSKKKKPPKP 


2011 


15912 


A 


2026 


3 


380 


RLE CS GGI S IHCNIiR / IxPGFKRFS CliS li 
PSRWDYTRLP/PFVFLVETGFHHLGQAG 
LELLTSGDPKCWDYGC\DHCTWP 


2012 


15913 


A 


2027 


32 


296 


DYiyiNSLMYFHSLVLASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/CFFFPKNRK 
TRFVAQAEGQGGNFGSmPLPPGFRGFP 
CLSLT 


2013 


15914 


A 


2028 


2 


187 


FTLLPRLECSG\MILVHCSLNLPGLRWS 
S CLSLLSS WNYTCVAPFS I FY YFI FLTW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ lO 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glyclne, H=H!stidine, I=Iso]eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=posstble 
nucleotide insertion 














GGTLNC 


2014 


15915 


A 


2029 


197 


3 


AAGLSQAGMQWGDFGSREHIiPSRFQRFF 
CliKVPNNWDYRHGPPRPVGI FFFSF\LV 
ETGFHHPGQ 


2015 


15916 


A 


2030 


88 


362 


KFGINLKKKEKKGPQKNQNFFPTFLFFF 
FFETESHSITQAGVQWRDLSPLQPLPPG 
FKSLPSSWDYRCLPPCPANFCIF\IPDS 
VSLCWQGW 


2016 


15917 


A 


2031 


7 


131 


GTIMAWAPGEGGCSEPRSCHCTPAWVT\ 
SETPSQKKKKGVLIi 


2017 


15918 


A 


2032 


1 


343 


LECSEPRLCHCTPAWV/ TGDSVSKKKKK 

KKKK 


2018 


15919 


A 


2033 


243 


14 


QKXiAPWPPQSAGNNRRVPPHPASMAFLT 
KIEKTALKFIQNHKRPQIAK\TILSKNR 
VGS ITLFTTS KIPfYMTTVIKTI 


2019 


15920 


A 


2034 


238 


3 


FLALPKVLGIIGCVPPHPASMAFLTTIE 
DTALKF I QNHKRP Q I A\ KP I LS KNRAE S 
I TLFNS KIHYMPTVI KPLR VTS T 


2020 


15921 


A 


2035 


345 


2 


liVKI QKEMNVFMTANP I S I PQFMDQGVI 
S S CKS YYFRNKFLYVV lAAMDTD / S SNG 
SWQSEWKTFWKGFI ILDGIKNTDDS WEE 
VKI STLTGI KKKL I PTSTDDFKEFKTLV 
EDVT 


2021 


15922 


A 


2036 


1 


140 


GRCC \HELRS RHCT PAWATRAKLKKBCKK 
KKREKKTQKGKNGLGFWAF 


2022 


15923 


A 


2037 


199 


348 


RSSNEGGRDCVFCFGDRVSLCLPGWSAV 
AQLWLTATSTSQ\VKSLSHLGIi 


2023 


15924 


A 


2038 


3 


193 


NGUSTAP I KRLRLADTW I KSQD PS VCCIQE 
AHLiTCRDTH/RCYLKG/WYKAF 


2024 


15925 


A 


2039 


U6 


337 


S KLLKI LPRLCWGWWQAPVI PATQE VEA 
EDHLNPG/RSRPAWATQRTP VS I KKKI 1 
LRPANGK/CHGPSW 


2025 


15926 


A 


2040 


368 


252 


VFFFFALFYIjFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGIiTVCSTSPFFLSLLPPSEES 
ACFPFAFCHDCKFPEAS /SVMLPVKPVE 
L 


2027 


15928 


A 


2042 


103 


356 


WHFSPPQPLPPP/PLPNPPPPPTPPPP\ 
PPPPPPPPPPSPPPPPPPPSPPPPPPPL 
PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


2028 


15929 


A 


2043 


3 


344 


LYKWNNKASLAAHLFAAWFTVYFKPTVE 
TYC/SGKKKIPFKILLLIDNTPCYPRAL 
LEMCEE IS I VFAPATTTSS /liKPMDQGV 
IVTFKSNYLRNTFQAGGGEKKEKHERKK 

Nirs 


2029 


15930 


A 


2044 


2 


349 


PRVRKSPGPNGFTANFYQTFKELISILL 
KLFQKKKKIKKGENPPNSFYGAIIPRIP 
NPNMDLSKK/ETYGPVSGRNMBAKIFTK 
FIiAGHFKQSFGREIHHDQREFIPGIQGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXFFLVWLVWCWLLFXVWLWLCFFG 
FWCCWFCXCGVWFFVFFWCVLCGVL 

LGCWC 


2031 


15932 


A 


2046 


3 


284 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSLiLLPPSASLLLPPCTSIiLiIHPPTSLQ 
LP PLPS F YLIiLPTS I SHHLPTHNLPTTS 
IQDPSTPCSIK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2032 


15933 


A 


2047 


95 


1 


GRVDAIIoAHCKLR/ LPOFTIiFSCrCSLPS 
TWDC 


2033 


15934 


A 


2048 


211 


I 


EPTTMTGAKWGSSQTTTWYHMLQSLRIN 
VRVDPFPFFCTDR/SLSMLPRLVLNSWI 
PAI ILPHPPQVLGLQ 


2034 


15935 


A 


2049 


35 


266 


EVRRPSTWQPPRLRSEEPLRL XQPPRLG 
NIGFCHVGHAGLKLLTSSDIiPTIaASQSA 
GIT\GHSYRRASEEDKKESSMSS 


2035 


15936 


A 


2050 


281 


3 


QSNFKRAL.LS ILIiKAIYRSNI I PIK/ VP 
MTFPTEI /EKVI PKFIRNLKRLRIAKVI 

lskkjtktrritlsdfklyyyX IAVPVAA 

WYWKKNRLSDQWTT 


2036 


15937 


A 


2051 


251 


12 


ilrfflpkflgfqkppslekthfflpaf 
gnslfwppeklgqkkvfffffffflrqg 
l/nsvsqagvqwhnlgllqsplprlr 


2037 


15938 


A 


2052 


2 


325 


adhlrpgvqdqpqpgqngktpsplkiek 
\lagcggghprlreenclnpggrgcsep 
rsrrctpaw/vndsktlarkkkkkkgve 
knecgrkvmrv/lqgpkakvkpwgenl 


2038 


15939 


A 


2053 


3 


166 


slllprvecngaisahcnlhl\hnlhlp 
gssdspasasqvaevrgsleprsssla 


2039 


15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLILIiFCE/YRVS 
ITQAGV/QHDPGSLQPLPRVFKQFSHFS 
LQSSWDHRCAPIRIiAIFWVFC 


2040 


15941 


A 


2055 


301 


0 


QRKSHMFLTINQKDEMIKLSEEGMSRVE 
TGQKLDLMC / QVS QAVNAKE KFLEKI KG 
DTSVHTQMIRKQSSINVDMEICVGIVWIE 
DQAP/HNHIPLSHLLMRAR 


2041 


15942 


A 


2056 


1 


109 


RPLRRLRQENRLNRGSRG YS E P KLC/ HL 
CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 

PPP 


2043 


15944 


A 


2058 


134 


2 


ekesrsva\rlkgsgaisa\hc]srl\clp 
gssnsppsasrvagssga 


2044 


15945 


A 


2059 


120 


326 


nslvadme/kvlwwiedqtshniplsq 
s/liqskaltlfnsikakrseeaaekif 
easrdwfmrfkikqk 


2045 


15946 


A 


2060 


102 


413 


ertgfrhvgksglefmtsgdpptiasqs 
vgit/vheprtrpg 


2046 


15947 


A 


2061 


311 


1 


ffkkqrfcksggktpgkprgpkklganq 
fsgptpi»kngvgpspgpkggffppppgg 
fpagrpkwltrilgkggsllrg/rypkk 

GFLKP ILGDNSPQRAPKRGP 


2047 


15948 


A 


2062 


229 


380 


WYDLGSLQPMPIiRFKQASRWLPRSWDY 
RHP PLSRLNX I VF I VDTG I LHVG 


2048 


15949 


A 


2063 


142 


383 


PQSCFSTHWQLLQKQEETAGAVSVCVCT 
S \VCVCVCVCVCVCAGAMCVCAGA/CFC 
VCVCAGA/ CLCVCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


NLIQIKAIiTHFSSIKAERGDKSTS/EKF 
EGSRGWFVRFKERGHLCN I KVKHEAANA 
YAEAACSQLSRRS S 


2050 


15951 


A 


2065 


348 


3 


WVSPYSPCVCVCVCVCVCVCVCRDWVSP 
CCPGVCMCVCVCVCRDW/CFTM/ CAQVC 
VCVCRDWSPCCPGVCVCKDWVSPCCTS 
WS ILKLLSSGNPPTTVSQSAWITGRSHS 

AWPARA 


2051 


15952 


A 


2066 


73 


286 


NLIRGLLESHILISMRYGYCKSYTLSMR 
DIPEALNKWKS/IPMFCSWSRRLIVSMA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=^Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /-possible 
nucleotide deletion ^X^^possible 
nucleotide insertion 














VLSKLINRFNAI<CDVQH 


2052 


15953 


A 


2067 


329 


0 


PSSTFLLPFPRPPPSPPPPPPPPPPSPP 
/PSSSSPAPSSPHP 


2053 


15954 


A 


2068 


221 


3 


ELKVI LHCLRDFS LQS S IMKVLILKI L I 
LSVCCVC/VCVCVCAVCSCVG/VCSCVC 
/VCSCVCXjCVLCVCVCVLSC 


2054 


15955 


A 


2069 


I 


167 


GTREWPLNPGGGGCSEPR \ S CHCTPAWA 
TKSETLSQKKKKKKKISNYKTPPKSYRI 


2055 


15956 


A 


2070 


2 


354 


ARACLGIiPSSWDP/ SVENRPHRVGQAGL 
EIiATSGDPPTSASPECWDCRH\DHHTWP 

LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNTRGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRF/LKQNSSSS/SEVN 
EISWlJWDNNMLYPDTGNGClMIIiSYPIT 
ESRAIYQRPIFHVHVNQDLPHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSDPPASASRGAGIADGVWFTQ/ 
SLNGAQAGVQWRDLjGSLQPHPPSRL/LL 
ASQSAEIAASARPPPRLGSEERLCXjAAH 
RLGCEEPLCLAAQSGK 


2058 


15959 


A 


2073 


1 


338 


GSRLQRVCINYRLSFFPFLSQGWINFTW 
LFCLCVCFLRDRVSLCCPGWPSTSGFKR 
S S C P S LLSRWD YRHMPQHLASHTLFKKIi 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGQGSGRRRRGRGG 
PRTGGAEGGRGAGETPGGGARPEREQGR 
GRHSERQGPT/RQTKRPKTKTKQLSQNK 
NT 


2060 


15961 


A 


2075 


450 


225 


TPVGRGCSELRSCNCTPAWVTD\ETLSQ 
KEKRGGVKIGWKKRRTRI I SLHLPGSHE 
KFNNLETVKNCVNHFCRLNT 


2061 


15962 


A 


2076 


2 


470 


TPQNKPHPTTKNTQPQPTQKK/ PTQTQP 
/TTPKPKTKKNTTPNPAPPDJTQKNHTTK 
PTTQNPHKTKQKKTKTKPNTTQQQNKPTN 
QNPKTQTQQKTTTQTKPPKKKKKKTADT 
TSPNPISTKKIKKLAEP 


2062 


15963 


A 


2077 


304 


1 


NSATPPCSPTA/KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 
RKI I PKGRRHPIjAPPQAARRDLNYYHP I 
IWRGRVCVCVCVCVCARA 


2063 


15964 


A 


2078 


167 


1 


TILQTNSTWSNVLLWQGAVAQAC \NS ST 
LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 


15965 


A 


2079 


3 


364 


HETRSRHSCAWCSAALWRAAVASRCPSS 
/ 1 PVTPPQCLYWP / wkvplqcppdl 


2065 


15966 


A 


2080 


303 


2 


SKRGRPSGHECPFLGTSSSCRHVASCTI 

RTPRRLCKSQLDRCSPLKERHKFGLSRS 
VLSAMTQSGIY/WQPPPPEFKRCFSCLN 
LLSSWDYRHAPPRRARA 


2066 


15967 


A 


2081 


4 


326 


AGITGMSHRAWLFLYFLNKFAFT\YGLV 
LNFFLHKIQEPSLG\SGSGP/LSCNS 


2067 


1596S 


A 


2082 


1 


343 


pppppppppppppppppppppppppppp 
pppppppppppppppppppppppppppp 

PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 


15969 


A 


2083 


330 


88 


ACDRSriSRIHKEIi\EQLCKQK\SNNPI 

kkwpicdmiryfskqdvqtvktmnkcsts 
liiiemqirstmryylttvrtphps 


2069 


15970 


A 


2084 


1 


340 


RVRSHGTTHIiAQL I FVLL VQTGFHHDGQ 
DGPDLL/NLVIRPPQPPKVL 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ II> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ IB 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L,ysine, L=Leucine, M=IVIethjonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2070 


15971 


A 


2085 


2 


348 


ARALSLGANAAGNFKLiRPLLRYHSQNPR 
ALKNYAKYTLPVLHRWHSESLETAFTEC 
FTEYFKPTVETYCSEKD/I /SYKALLLI 
EKALGHPRALMEMYKDIS I VFMPVNTTS 
ILQPMD 


2071 


]5972 


A 


2086 


369 


2 


KGIiPPSPANLFFLSVY/MGFPMFTRLIL 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
SFFYQKFVRFFVLARNRTFGFVDYFS IL 
YLIYGCSNIiYRFLLSALYFFVSVFVFVL. 
TQGIEAMVPRA 


2072 


15973 


A 


2087 


314 


3 


L.LESVGPTRNSRPFKGLIGRILLDPEFY 
PSLVSDDLPASASPNAGRILPFFFFETG 
SHS\TQAGVQWRHHGSLQ\LKRSSYLSIi 
TSSWVYRHTPPHPANICIFSRA 


2073 


15974 


A 


2088 


330 


16 


CPCFFIiSALSVLVGWCFAFWCGCV/WW 
CVCFWVFVCVCFLGCWLCLCFV/VCIi 
VGVCCWFFGGLCCVCC 


2074 


15975 


A 


2089 


1 


337 


GTRTFLPPSYKDPCEY/IWAHPDNPGSS 
SNCNMLNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPHLPSPNPSSP 
SPSYPLIiSDSTICQTTPTTTPITSSHTL 
liTS 


2075 


15976 


A 


2090 


350 


84 


QRKENKRTRKRATERRDESREEKAGRKE 
GENQE KRNKEQQGRQRRSRDRT / E EE KE 
EAKRREHKNPKKKKTKPPPQKKKKTEKIC 
KTDNLSN 


2076 


15977 


A 


2091 


1 


355 


SDPPTSASQSAGITYVSHRTWPLLEFSG 
TS IRLAGKPAGVLVE VTGK/ S VCGGGVT 
KTHWNECHTGYPKCCWSSQAGESSLQPP 
PPGFKRFSCLSLPSSWDYRLLPP\QNFC 
IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


AS I PCLKKRKKKKKKTGGKP PP \GGPKP 
KRGGGGGPPPTKRTLFGPPQGNQTPGGV 
S/GPPKGDPPFSPIPPCQRKSPPFEKGG 
KKGPPPWRGVKREKGHFLKNFQQK 


207S 


15979 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTLASPKCWDC 
KR\D YCAWPHI FP I SGPLYVFFPLPRMS 
PHTHTHKYTRTFFHYAHKHCACTLNIjLL 
RAQFQCHFFKGWPDP 


2079 


15980 


A 


2094 


76 


342 


wfxxfffllfvffvfwfvcfcvfcgfff 
fcffccvwfvfgfclflcflfwfcffvf 
pgpfgfwffvfflvcflvfvcffgcfff 

LFCFF 


2080 


15981 


A 


2095 


115 


325 


mderkkirgggrqgkeckihckkklspg 
irsypven/f/vdtmydylqpayyklnd 

LTNADPCAVRYLLFDQN 


2081 


15982 


A 


2096 


80 


227 


S CliGN/ C I HLYSHS PTLSFTHTHTHTHT 
HRERERERERERE I CICMSVYA 


2082 


15983 


A 


2097 


308 


1 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 
SWNQPKCPSILEWIKKL\WIYIYVCVyi 
CVCVCICVCVCVCVCVCVCIYIYMMEYY 
SAIKRNELiAAFAVTWTRA 


2083 


15984 


A 


2098 


2 


361 


ARACLGLPSSWDF/ SVETRFHRVGQAGL 
EIiATSRDPPTSASPECRDCRH\DHHTWP 
LL 


2084 


15985 


A 


2099 


1 


221 


LliWRLRHENHLNLGGRGCSEPRLHHCTP 
SWMTR/GKTPSQKKKTKQPMEWQITLCLK 
KVFGKTGVREIFNGIKLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=L,ysine, L=Leucine, M=Methioiiine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2085 


15986 


A 


2100 


186 


2 


ERVYVCVCVGVCVCVYQCYLPQIRSIi/V 
ASSLKANARVCVCVCVCVCVCVCVCVCV 
SLWHPLTA 


2086 


15987 


A 


2101 


305 


2 


HNILLSQSIi IQS KALTLETSTSMKA/ E/RG 
EEAAEEKLEASRGWLMRPKE\RSHLCN\ 
TKVQGEVASADTEAAPSYTEDHSKITDE 
GGYTKQQIFNVDKKASYWNMS 


2087 


15988 


A 


2102 


6 


166 


eqtale i larairqekeikgi q irkeev 
klcffvndXmvlylenskdflkvdaa 


2088 


15989 


A 


2103 


53 


176 


E I KNNRPGMVAHAY / NPSTLGGRGRQI S 
WGQEPETSIiVWiyiVK 


2089 


1599D 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT 
GVSHCAQPIVGDFNTPLSIL/D/RSTRQ 
KINKDIQDLNSAIiDEADLLDIYRTLiHPK 
/STEYTFF 


2090 


15991 


A 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
E C I TR I TNTEKCLKELMELKI KARELJiP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


GFHHVDQAGLELLTPQVIHPXLGLPKCW 
DYRREPPCIiASPHFHQIAISQKRHREAK 


2092 


15993 


A 


2107 


1 


398 


SARGPDGFTAEFYQTFKEELVQILLKQF 
QRIKGE I LL/KNHYVKPS ITL I P K\ PG\ 
RDITKKLLTRSFCVSLCPVLSPLQSLQS 
RPSSLSMISLHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 


HENWNNKGWGDSTIYSMKYLYFKPMLRP 
Y/C/SQKKIPFKIIiliLFDNAPGHPRVLM 
EIMYKDMEVF\MPVNTTF/ILQPMDQRV 
ILTFKSYYLRNTFHKTIAAINSDYSDGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


I 


213 


HFPVENESAPG/FKAAGDLLTLLLGGNA 
AGDFKLKALLiVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


KKKKKNPHRKKIKKRWKKIFNAERNKKR 
AGVIIHILDRLDFKQQTIRRDKR\GYYT 
M INGT I QQE VI T I LN r HAANTKAVRYIK 
QVLLKLKTELGPNTI ITGDTNTLLLSTL 
NRSSRQKNQQTLDLICNI 


2096 


15997 


A 


2111 


188 


365 


FQNTIHI CVCVCVCVCVCVCL/CVCFCV 
FFFCLLCGGFMCGCWCDFCILFCFYGVG 
FFFL 


2097 


15998 


A 


2112 


3 


340 


RMESALDRLKQFTTWGDTGDFHAVDEY 
KPHDATTNPSrilliAVAQMPAYQELEEEA 
lAYGRKLGGSQEDQIINAIDKLSVLLGA 
EIIjKMITGRVSTEVDARLSFD/SDAMVA 

TA 


2098 


15999 


A 


2113 


11 


305 


FIiFTDFCLFMTHIIiGHKINYITN/ CKRN 
VI IT/SYFSPHNRIKLKISIRKISRKSS 

ntwklnnrllhypq i kde vsre irkyle 

lninentnf/qnlwdihk 


2099 


16000 


A 


2114 


3 


387 


QTNH/NIPLSQNLIQSKAITA/NCMKAE 
RSEEAAGKFEASRGWFIVIRFKKSSYX IKV 
QGEAASAGVEAVAIYPDLAELIDE/GCY 
TT / Q I FS VFQTAFFWKKKPSRTFM/ REE 
KLIPGLKASKDSSSLLLRVHAAGD 


2100 


16001 


A 


2115 


152 


393 


VYCPICWLVFFFCFCSVLIIjFVMFV/CIj 
CFSFFCFLGFWVFSSFYLFICVFFFIV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaglne, P=i*roIine, 
Q=Glutamine, R-Arginine, S«=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














LGCLFLCWFDLFLFVLFVLCLFFVS 


2101 


16002 


A 


2116 


2 


373 


ARACLGLPSSWDF/SVENRFHRVGQAGG 
ELATSGDPPTSASPECWDCRH\DHHTWP 
Lli 


2102 


16003 


A 


2117 


3 


449 


HEFDHAMLQAHRAHHLAIDAYHEPEETY 
IPKDQKYSFLHDSQTSPCFSDSIPTPFN 
MEETQQKSNLEIiLRISLLLIESWIiEPVR 
VLRSMFANNLVYDTSDSDDYHFLKDLEE 
GIQTLMGRAEKRHCRTVQNLKQTYRR/F 
DTNS/HNHDALL 


2103 


16004 


A 


2118 


146 


15 


FFFFYFFFLFFFFFIPPLFFPFFLFFFF 
PFYFIFIIiFIYSLLVF 


2104 


16005 


A 


2119 


406 


3 


liFSVNETGFYSKMLSRTFTATEETSIPG 
FKVSKDKLTL\LGANAAGDFKLKPMI.IY 
HS /ENPRALKNYAKFTPPVLYKWKNKAL 
MTTHLFTARYTEYFKPTVETY/ ILLLID 
NAPTHQRALMEMYKEINVFMPAMTTS I 


2105 


16006 


A 


2120 


109 


396 


YYFYFYLVLFYFFIFLFFYFFFFFILLL 
FYPPFSFlFFFPIiFVYPIFLCYFFLFFF 
FFFFYLFFYSFFFFPIFLFPPFFFFFFP 
SYYLIFFLSYIl 


2106 


16007 


A 


2121 


15 


413 


I VLARNTNF WL S FL F P VALG I L I VLiKGV 
KYIFWPLEYCQRLKMFVSYSFHY\FFLG 
SLLFLKYGFHMYLILBL/CIFIIIMCFF 
I KYS FFFCCIiYHFFFS FYLFFXiYFLI F\ 
CYLVILFFSFLLFLFLSSYCFFF 


2107 


16008 


A 


2122 


1306 


429 


SSSSSSSSHVLRIIKDEDFKILEQRQW 
LSEKEAQALCKEYENEDYFNKIilENMTS 
GPSIjALVLLRDNGLQYWKQLLGPRTVEE 
AIEYFPESLCAQFAMDSLPVNQLYGSDS 
LETAEREIQHFFPLQSTLGLIKPHATSE 
/HKRGPSMVMILTKWNAVAEWRRLMGPT 
DPEEAKLLSPDSIRAQFGISKLKNIVHG 
ASNAYEAKEWNRLFEDPEEN 


2108 


16009 


A 


2123 


3 


206 


LRRLRQENHLNWRGGGCSEPRSQHCAAA 
W / VSNS ETPQKKKKKKEKKKKKKWLPNS 
ALKKTYSQRGKLF 


2109 


16010 


A 


2124 


23 


401 


IASGRPFFFFFFFFFFWPPPPGGYPHFS 
FLKKKKKRGGGGGKSLLPPGKGPNPPK\ 
WGFPLFPPPLFPQKTPPPPFPFYKTPPI 
PPQGPRGGPLKFTPPGGGGGNPPTIIjLD 
KKRGVLGPPPFFWTJSr 


2110 


16011 


A 


2125 


3 


439 


MFDVSLLTPTHSLFHFSPQPHRKCELST 
LCDGGELRDHILLPTSICPITR/DKCSC 
PGEGC 


2111 


16012 


A 


2126 


1 


213 


HFPVENESAPG/ FKAAGDLLTLIiLGGNA 
AGDFKLKALLVYPSENPCFLKGS FKPNIi 
PLVWCSHKKAWVQLG 


2112 


16013 


A 


2127 


104 


419 


NSFPFEEELYNPFGGLGKKTFPGGGEEP 
GHTPPBNEALGGKNKFFTGBGGQTPSNN 
GEEKSVSFWISIEKILHRALL/AHAIiCK 
NCWELNFGQKEEPPPPPPEEF 


2113 


16014 


A 


2128 


399 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPLiKKKKnSTPPPP 
PGNFGPPRGFFKRPPPFFFPPFPPPFFF 
FFFPF 


2114 


16015 


A 


2129 


10 


457 


KTS WTWCPVPWPATQKAB / AGGS PEPG 
RSRLL 
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SEQII> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ m 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2115 


16016 


A 


2130 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL 


2116 


16017 


A 


2131 


317 


177 


FFFFFFFFFFFFFPFFFFFFLRLQI FFF 
FLPILIFLFLPFLFFFLFF 


2117 


16018 


A 


2132 


3 


542 


EPWS VAQAGVQWRDLS PLQPLiPPRFERF 
SRLSLPSSWDYRRLPAHPANF\QFLVET 
VFRHVGQAGLELLTSGHLPALTSQSAGI 
T6MSRTS\RPGFLFKV 


2118 


16019 


A 


2133 


146 


439 


LKSVTSIAKTWIQPK/CPTNSEVDKCPS 
TVKWIKKMWYI /YFTMEYYAAIK 


2319 


16020 


A 


2134 


95 


1 


GRVDAIIiAHCKLR/IiPGFTIiFSCLSIiPS 
SWDC 


2]20 


16021 


A 


2135 


2 


380 


THTHHTH/ THTHTHTHTHTSLS 


2121 


16022 


A 


2136 


4 


337 


iCRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNWFRPANTASILQPMNQGGISTF 
NSYYLRNTFHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVSSSQVRASVYLKKKKKKGPPPEGITN 
TAGEKPPQSFRGK6\PPFPLISPKKEPV 
I\SFLKNFGPCTIGKKKPPHPPAKNGGP 
LQ/RPPTQGGGRGKKKGKNNQRPLSGLG 
GNRSPKPPF 


2123 


16024 


A 


2138 


2 


357 


FLGSSDPPPSASPVGRATG/RVFFFFFL 
VEWSHYVAQNGLELLDTSNPPAVASQS 
VRITCVSHRTWIiLSPLYKIIQVCVSK/S 
PELEQSEDKSLKK 


2124 


16025 


A 


2139 


379 


16 


LLQVRCFVSTVNRGSSCQKTIQVYYVQE 
AIPPSFLLSPFLM/PYTKINSRWIKDSN 
VKPKTIKXTLEENLGMPLNI FFTYQIiLW 
LiYLHPESQLEICNSPRALQEGNLIIFFI ' 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NYLLlSTNIiFFFFFLERGLTFAPRAGGWGG 
NLTSWNLGPPGPNKPPPFPPKRP\GTPK 
PTLKEGIiFGPFKTTGFPPGAQKGPELPG 
LRGPPGIiAPPRGGNKGGNPPPGPLKTFiT 
G 


2126 


16027 


A 


2141 


46 


421 


AGVSWRDHSSLQPCLTSRARAVXSHLSF . 
LSSRDYGPMPPPPKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKQ ' 


2127 


16028 


A 


2142 


26 


479 


LAYLLPSPVMYKPQNWRTILQSISNKGL 
I CRI YKETLQLiraKKTNNPI FFKWVKKS 
KHFTRGAIQVANKXHIKMKIiNI I SVYGN 
AQKNTMRYHHTP I KMKKMGCALWLTP VM 
CQBAGYILKMYPGWTPYPGRWGRPTAP 
ALPPRGTFPRD 


2128 


16029 


A 


2143 


9 


166 


QNRLIARIiTKKKRGKNQAHP/ IKNDKGD 
ITADPTEIQASIREYHKH , 


2129 


16030 


A 


2144 


1 


328 


LEKESRPPPPPPPPPPPPPSPP/PHPGS 
LPKPLGLQPLPRPEVSFLTEYLPKLKAC 
EGGGVE I AAAS FPRI YLMGMCSRQDRI Q 
KDIDWIQKSRAEDCLFAGVKAILKA 


2130 


16031 


A 


2145 


1 


440 


KTFLRSLWQLVE/CYISSGLIDDHRRPM 
ALTPRHPRSGITAPRPRPQPPGRVGIPE 
PTALSPSPGPPPPPCSTPGRCQVPSLER 
RRKEGREPPSVGRGCGGHGISPSSDIFF 
HLNFCliAHPPDLFVFVCSlSNQLYIFHE | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteme, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=GIycine, H=Histidine> I=IsoIeucine, 
K=Lysine, L=Leucine, M~Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glut£tmmey R=Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide' deletion, \=possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 


A 


2146 


23 


422 


iasgrpffffwgap6geegkknffwgpr 
ggkkkirc3aptprgkgnpppkpwgkkgg 
appknlglfgkkgaqrgppgwfktpgkk 
kspppgpkkgkknrgrpXgpppqiffkp 
ffstgeknplgkktpnwrllpw 


2132 


16033 


A 


2147 


3 


372 


kkntpffltprgppppprgplspppkra 
tpppppfkkkkpgfppkkknffpppggg 

PPPP /IiPGEKKGGYF/WW 


2133 


16034 


A 


2148 


363 


1 


FNCRWFETKSHSVPQAGEQWHSFGPLQP 
LPPGFK\SFSCLSIi 


2134 


16035 


A 


2149 


2 


361 


FFFLVETGFHQVGQGGLEHLT3DDPPTF 

GLP\GGWDYR 


2135 


16036 


A 


2150 


387 


2 


KKSFFLVSPARVQWGDPFNPPIPGSNNP 
PFSTPPKTGVIRGPPPARKIFVFFIKTG 
\FPQLGRGVLKSLPQ/CDSPPPAPPKGG 
VSGGNPPAPPFFFFFFEMESHSVSQVGV 
QWRNLGSLQPPSPGFKQF 


2136 


16037 


A 


2151 


23 
- 


421 


IASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKFFPPPSP 
KGGFLTPPPPPPFFFFFLKKKGVINGGG 
GGVKISPPGGTPPFLPQKGG/NKKGGPP 
PPRKKKIFIiFFFPGAIKQRPPP 


2137 


16038 


A 


2152 


3 


366 


viwahcifhllgytdrpclkkkkkgfpg 
gekmglknffpplgkkppppqipnfwsv 
iekpprgfylggaappkkffllkp\gpp 
ffkeppppkkkggaspplkkkffkgkks 
ppfffsdpts 


2138 


16039 


A 


2153 


423 


253 


FPWGGGAP IFPPPKKGFFPKNPPGVFFS 
PPKKKKIFFFP/HPVNFGPPKDFFKRPP 

p 


2139 


16040 


A 


2154 


287 


1 


KEMNKKKNPKKKKVSKKISTAFYNKTLN 
NINFF\FAFFFFFFFFFFIFFFFFFFFF 
ppj- p p F p p p PF PFp- FFRPLFLFjjYKQTA 
QLMQHSAPTRP 


2140 


16041 


A 


2155 


433 


2 


RRVLFIFPPL/HPFSPPPIiPFSPFPPPP 
PMKFFSPPPTPPYFYKSPPHPP/HPPQV 
VFSPPTPPMPFITPPPPPHTSPSFPSFF 
FFFPFFFFFFFPSFFFFFFFFFFFFFFF 
FFFFFFFFFFFFHIISIKGEIGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 


606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
KGPGASPA\PCGAAAGSTAQG\GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVLWQHGIGRP 
GAHRPGPCAP/DSAPRAPGGRCG 


2142 


16043 


A 


2157 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 

kkpppppvggpgffpqkiclksffplppp 
fplggggpppppppkiksfypppkkvsf 
ppppkkafiipppppplpppppsppqppp 

liF 


2143 


16044 


A 


2158 


241 


5 


kkfsffppggsqggffgslqnpppgftp 
ffclnlgkkwgqrgppprpnnf/sffff 
fffflvetgfhhisqdgldllts 


2144 


16045 


A 


2159 


434 


2 


lgsasqlgcsgvrdpleeavcpfsdlql 

CARRTTALFKAVRQGHLSLQRLLLiSF\V 
CLCPVPRSGAYRGRQASIiSCGGLHPVQA 
SRLLCLPKQAWAMAGAP/PPASLQPCSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=l.ysine, L=Leucine, M^Methionine, 
N=Asparagine, P=ProHne, 
Q=GIutamine, R=Arginine, S^Serinc, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /==possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














I SDC CASNQRDS VGVGPSEPRAGYNLIiV 
RCFLSP 


2145 


16046 


A 


2160 


448 


84 


PLGGFFFFFFWFPPPSPPQKEAi\QKKI 
FFVFFSPESNQKNFFFSFSERGGPPNPIi 
FKGGGR\FSPWPLFFFFFFWFFFIiWFF 
PFFFFFFFFFPFFFFFFFFFFFFFFFFF 
FFLGKCPLKyT 


2146 


16047 


A 


2161 


3 


156 


RGCSEPRLRHCTPAWVT\KETVSQBCKKK 
KFIiTI»GGKTFKNFFFFNRAPGTFP 


2147 


16048 


A 


2162 


412 


37 


PFHSPPPPPPPAGGAVFPPKKKKKSPPP 
PTPLLL/RGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 
PPPPPPPFFFFFFFFFFFFFFFFSLFFF 

PLRTLWRKGRGRGI 


2148 


16049 


A 


2163 


5 


370 


Q S S C VQ WL VP WPT I QE AEAGGFLEPRS 
ERLQ/CSHCTPAWA 


2149 


16050 


A 


2164 


437 


36 


FFFFFSEAESCSVAQAGVQWHARSLPQP 
LPPG\SSDSCLSLL.SSYMCLSPRTRGS 


2150 


16051 


A 


2165 


3 


623 


RQGFTLVAWAGVQWYDLGSLQPPPTGFK 
RFSCLSLPSSWNYRHAPPCPANFVFLEE 
TGFLHVGQAGLELPTSGDLPASASQSAG 

itgmshca/rpkvcsyhlff 


2151 


16052 


A 


2166 


337 


0 


PPPPPPSPPPPS/PSPPPPPPSPPS\PS 
PPP 


2152 


16053 


A 


2167 


350 


156 


AE ATS KI RCQKC Y I M I AGHS GTRLQFQL 
LQRLRQENHIiKPRA/RGCNEPRSHHRTW 
PTWYISKSFIiA 


2153 


16054 


A 


2168 


1 


337 


DERSLSQRSRSWSYNGYYSDLSTARHSG 
HHKKRTKKKK/ 1 KKKKKKKKRGP FKKKG 
PLKTRKKPRGGLLKAHPFWGGPPP6FPL 
TGEGAPPPVFFKKKKKKPPIiGGKGGFLW 
G 


2154 


16055 


A 


2169 


2 


606 


RVLRAVAAHEEPDKEGKEKPHAGVS PRG 
VKRQRRS S S GGSQEKRGRPS QE P PIiAP P 
HRRRSQPP\HPGPLPPTNAAPTVPGPVE 
PLLLPPPPPPSLAPAGPAVAAPLPAPST 
SALFTFSPLTVSAAGPKHKGHKERHKHH 
HH\PAPMVIPAPAEPI 


2155 


16056 


A 


2170 


311 


2 


GLQPLGLGSVKQCMDLACVPETVCVCVC 
VCVCARAHTPVCTQGCVPESTQCVCMHV 
C/VCVWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAIRQQRC 


2156 


16057 


A 


2171 


78 


337 


NTPNSIYKAMSLKGPITGXFLPNYPGHK 
VCVCDTYIiCVYQHTHTHTHTHTHTQAFP 
HI/HIYTHT 


2157 


16058 


A 


2172 


2 


278 


KNRLNPGGGGCSKLRSCHGTIAWAT\SE 
TPSQKKKEKRKRKGRKRKRKKKIVYTKI 
EKGNAKMTGESFIHFCKKSSSVPQVALS 

AEYRCSST 


2158 


16059 


A 


2173 


2 


365 


FYHVGQAGLKLLTSSDPPALGLPKCWDY 
RRE\DCAQPNVKS 


2159 


16060 


A 


2174 


195 


389 


FRVFSKIiKYYYDFFRGRISLSCPGWSTT 
PGFKRLSCLSLPSSWDYRRPPPCPAN\F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 


lASGRPLIFFFFFFFFFFLFCRGGGPPF 
NPRGKERGGFLIKGPPPPGGKKNFWAPP 
PGGGEIKRTPPPPGVFFFFLKKKGFSFG 
GGGGKK/PPPPGEPPPPPPQKGGKKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Afanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidjne, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TRGPLF 


2161 


16062 


A 


2176 


2 


158 


FF FL KQ S LS VTQ AG VQW \ P VD S LQPL P P 
RFKRFSCLSLPSSWDYRCVPQCPAK 


2162 


16063 


A 


2177 


340 


120 


PRFHFLASPSEMSQMTVKAKTTVPASEC 
AYPKIEPFFPF\BPRGSES\FDKLGVHH 
HPLFPLSGEPLMILHKNTH 


2163 


16064 


A 


2178 


3 


256 


YLKKDIiNVNQKTLTLLE/EKQGK/HLHD 
MRLCKEFLNKTSKAQYILKKISQY/LIK 
VQNFNAVKDPVKRMKRQASD/WENIFT 


2164 


16065 


A 


2179 


95 


1 


GRVDAIIjAHCKLR/LPGFTLFSC3:.SLPS 
SWDC 


2165 


16066 


A 


2180 


1 


143 


ARGERERERERERERBRERERBRERERE 
RERVFV/RERAALSRERGDKCERSGLF 


2166 


16067 


A 


2181 


1 


304 


ARGQGHLSLQRLLLSFWCLCPGPTGGA 
YRGRQASLSCGGLHPVRASWLLCLPNQA 
WAIAGAPPAALLPPCSLISDCCASNQRD 
SVGVGPSKPCVGYNLLVW 


2167 


16068 


A 


2182 


3 


163 


FQRRS TESCGWDKDARS QSRS PPRQQAH 
GHHS /HTHTHTHAHTHTHAHTHAWTRP 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKGFFFLRQSL/DSVSQAGVQRH 
NLGSLQPLPAGFQLFLQPPPPGSKDTHP 
GVQRHDHNSLQP 


2169 


16070 


A 


2184 


228 


88 


WFFRIYDSFFLVFLFVFFFFFFFFFFFF 
FFCVFTPFPVYF I QYWYI 


2170 


16071 


A 


2185 


287 


3 


SLFIiAIPQWEFQRKIYLRKQKIVLIiACQ 
PS KGL S I VQWRGQLNS P VLKS QDPTP \ 
PRFKRFLCINFLSRWSYRHVPPRPDSFV 
FLVETGFLHLV 


2171 


16072 


A 


2186 


252 


105 


VSISCHPQ\PCLIYTCY/HIDGYSISriHL 
CTHTHTHTHTHTHTRVELQGPA 


2172 


16073 


A 


2187 


267 


459 


KHVPGSS YTWL/ 1 FFFRGNFLKKGANFV 
PQGVLKGGNLSSLYPPPPRLKQYSCLTL 
IiRIGNYRHA 


2173 


16074 


A 


2188 


170 


15 


RSVMKDUSrSHFSKENIQMANKRTKECSA 
LWFREIK/TTMRCHIiTPIRMATIKK 


2174 


16075 


A 


2189 


2 


504 


DVTISTCHASAKVGTRLVFDHYGKIIQK 
TPYPHPRGMTVSVKQLFSTLPVRHKEFQ 
RNIKKGLG\RRSCFCFDF 


2175 


16076 


A 


2190 


1 


565 


FPETESLSPRLECSGAISAHCNLHLPGS 
SDSPASASRVAGTTGAGHHTWV/ IFFVF 
PVETRFHHVSQDGLDPL/NLVIRPPRPP 
KVtiG 


2176 


16077 


A 


2191 


142 


382 


NTPPLIiFFFVIRDRVSLCCPRWSGVAQF 
WLSATYASRV\ KRFSCLSLP /SNWDYRC 
VP 


2177 


16078 


A 


2192 


138 


365 


KHQYHHCC/LLKKKKKKKKKKKKKKKKK 
KKKKl KNfKKHGGPFiCKNFW/EGQPRNWA 

GGV 


2178 


16079 


A 


2193 


385 


21 


RGEIFFFKTRRKKFSPQGGRGGVFPPSP 
PKNFFFPQGGKFFGG/EGGPKVPPPKKG 
GFPKKPQGGFKVPPKKKKKIISPPGVIG 
GPPGNFLKGAPPFFFFFFFFFFFFFFFF 
FFFLVRAVKLS 


2179 


16080 


A 


2194 


415 


56 


PPPPTTAPVFSPPPPPRPFFSPPPPVFF 
FRSFPPAPPPPPLFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTIPPPPPPFSPPPPPF 
FFFFFFFFFSLLPFFFFFFFFFFFFSPL 
FLFFPSRLW 
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SEQ IP 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanlne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L^Leucine, M=Methionine, 
N=Asparagine» P=ProIine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2180 


16081 


A 


2195 


35 


369 


MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTGPP/RGFFKGGERGPPPPKKKKDPPP 
QKQKKKKKWGGGGGGGGQRKKKHPQKNP 
PPPPGRGGGKNFLWGRPRGPPPQGGGGK 
K 


2181 


16082 


A 


2196 


2 


266 


SKPRSCHCTPAWATQ\ETPSYKKKKKKK 
KGLPLGGPGEKT I FFSLKPGQKAQPKNK 

IPFKKKIPFFSTPKGKKKKNFQGGPKKK 
KKEI 


2182 


16083 


A 


2197 


1 


277 


ARGERERERERERERERERERERERERE 
RERERRHP /RPRERE 


2183 


16084 


A 


2198 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERBRERERGGAPP 
PRLYRERVLKKQRGRPLPPVRERKK/RS 
PVQTLPGY 


2184 


16085 


A 


2199 


346 


1 


VPRHGLPCFCFPHQLQPPKRKKFNNRKR 
ALTSHTQFFVFETESCSVA/RECSGAIS 
AHCHL\ CIMRSSNSPASGGAGSSELRFR 
HCTPVGQQTE I WS PKNKP I KKEEGCPYC 
QFLSC 


2185 


16086 


A 


2200 


3 


389 


HEGMIIiAHCSLNLPGSGD/ASHVARTTC 
VHHHVQLILFIFVETK/FSHYAVQAG/L 
ELLSSSDP/PTSQSSRITGMSHH 


2186 


16087 


A 


2201 


3 


204 


HECHCTPAWAT\SETLSQKKKKKKKKKK 
IFPGEKTWGGWNKKKFLFLNRKPFLGKG 
liFHLKRGPLKLF 


2187 


16088 


A 


2202 


69 


379 


KKKRVFFWGPGGSKPPT/SGNPPPWPPK 
GGGLRGLPPPPGPRGVIFKNFGKRGPPP 
PPGLKSWGPRDFFGI^QRGGISGLNNG 

APPFFFGVIiKKLPPLiFFVSKG 


2188 


16089 


A 


2203 


81 


379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 
TFSGGGGGKKTGPQKLKAAGG/SKKAPG 
PPPGAEKKNPALGGEKKTLGPNGRGPKR 
liRGLAKKGNPPSSFGGK 


2189 


16090 


A 


2204 


1 


341 


GQSLPVSPRLQLSNGNRTLTLLSVTRND 
VGPYECEIQNPASAJSTFSDPVTLNVLCEY 
IiLF\PVAQATSPNPRSQRPGLSVPLSSK 
NADSPPTPRNPARHDFLPQANMGRPSLD 
QE 


2190 


16091 


A 


2205 


2 


370 


DAPPRPANFVFLVKTGFPPVGQAGFKLP 
PPGDPPPIjASQITGKG/HCAQPPF 


2191 


16092 


A 


2206 


362 


24 


PRGSSSASSSSSSSSSPPSSSSSSSSSS 
SSSPF\PPPPVLKPPPPPPKKSPPPKKK 
NPPPPKKKTFFPFFPFPFIiRRGLCPFGR 
GKRAKPPFQKKKKKRKRESSFYQQVIIH 
M 


2192 


16093 


A 


2207 


357 


3 


agqalwiiarvipvlwearsyyvriswyr 
nivamdsdsldgsghsklksfrkgfpvl 
yaikkthdswegvs\msaliwksliips 

CVDAFEEIiTSSAEEVAAQWGIATDLEL 
LVGCARA 


2193 


16094 


A 


2208 


280 


411 


gndvyflvflf/ clfekescsvaqagvq 
whdlsslqalpprrma 


2194 


16095 


A 


2209 


3 


392 


piiyekytqqinemprklqhqqlalvsr 
ngpillhdnaqlhitqc/lqklnelgyr 

VLPHIiPYSLDLSPTDYYFFKHLDNF\LQ 
GKHFHNQEDAENVSQKFVKSQSMDFYAT 
/GNKLISHWPNCVDCN 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQII) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspcn 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=:Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '^=Stop codon, /^possible 
nucleotide deletion, \=^possible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GIIiANfCASIKDIilSRIYKKLKQIN/KQK 
TNDPLKKWAKDMNRYFSQEE I QGVNKHM 
KK 


2196 


36097 


A 


2211 


213 


493 


GKRFYPACPGKIGALQGFFKGGGPFFFF 
FFFLTPPPNTNYIiTNKLKIKSPLCTFYF 
SNILGFYPSITHRTIPDAWVTA/SQSGP 
VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


liAPRVIFGPPKKPPERPPLFFFFFETES 
SSVTQAGVQWHNIjSS\PPRFKRFY/CLS 
LPSNRDYKRPPPHPANFCTPSRDGVS 


2198 


16099 


A 


2213 


455 


60 


NPRREVGPICPPPKIRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNLGPPRDPFKRPPP 
FFFFFPLDFFFFYFTIYKTSAACPRSPP 
AQPRVARRPLVPSS \PPLPSLCLiAPAPR 
GPGSLCPRGSLEGDNGSSPG 


2199 


16100 


A 


2214 


2 


243 


LTLSPRLKCNGMVSAHCNFCLLSSSDS / 
RLR/QENCLNLGGGGCSELRLRHCTPAW 
TTERDS VS KTNKKlsrSLKI FTCFFVDAA 


2200 


16101 


A 


2215 


1 


286 


FSQLIiRLFIC/SQGGRLLCCESCPASFH 
PECLS IEMPEGCWNGNDCKAGKKLHYKQ 
I VWVKLGNYRQVFPRTKRKYS 1 1 VQTS F 
ILWIQSDIiEIDR 


2201 


16102 


A 


2216 


173 


2 


I T I FFF VRQGLTQAGVQWR \DLSGETLV 
ILPPLEAPCSLQSSWDYRRVLPHLPNFC 

IV 


2202 


16103 


A 


2217 


216 


4 


VYPPSFMVFSQVFLiSSTHISLSFSFFWN 
YLFIYLFIYLFIYLRWSL/DSVTQAGVK 
WHNLGSLQPLLPGFK 


2203 


16104 


A 


2218 


14 


228 


KRSSHLFTDDI ILYMENTKHSTK/NLLE 
LIKEISKVTG/YQKSVAFLYVWNKQAIK 
KTIPLTIASKRIKISrSGQA 


2204 


16105 


A 


2219 


244 


3 


EVLNQNSGLPRWPNILRAKASLRVPRQC 
SRGWFSANGAGTTRYLYAKEWGGLGGG 
GCSKLRSCHCTPVWAT\SETLSQKK 


2205 


16106 


A 


2220 


162 


2 


INMVGNLFFFGDRVSLCHAGWSAVAPSW 
PTVASTSL\VKQSSFLSLPSSWNHRH 


2206 


16107 


A 


2221 


146 


3 


GRVDGVPWRNPGSLQPPSP\GSSDPPTS 

ASQESGTTGAHHHTRLIFVF 


2207 


16108 


A 


2222 


239 


2 


SYISKPDKFPHDSSPEIKPVTVNWRNSF 
SFIFPPFFLFPEKESHSVTC3AGVQWRNL 
GVSSYWP\AGLKLIiTSGDPPAIiAS 


2208 


16109 


A 


2223 


2 


159 


LNRDLGGGGCSELRSCHCTPAWAT\SES 
PSQKKKRKKKKKKKKGGRGRNSKI 


2209 


16110 


A 


2224 


3 


345 


RFKLFSCIiSLPSSWDYRRVPPRPA/NFF 
VFLIET\6FAIt.TSSKTERQSRLECIFG 
FYGLPCREKRASERRSVEG/HERKILFS 
PDFFFLGGTESCFVTQAGVQGCYLGSPQ 
PPPPG 


2210 


16111 


A 


2225 


103 


319 


FSEE YRNVTNFLMLTMSCS CLTLVE / C / 
WSEGYMATPCTILLLLFFQRHCLTLSPG 

GVQWCSHSSLQPQTPGIK 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGLELLTQWSIHLSLPKCWDF\ 
RHEPPHPA 


2212 


16113 


A 


2227 


2 


178 


IFLIFTFIiEMSSHYVAQAGLEFPG/FKL 
TSRIiSIiLSSWDYRRPPPRLANFFAFIAK 

GDAA 


2213 


16114 


A 


2228 


173 


3 


FIFIFISLFFIFFLRQSIVLSPSAVQSR 
LQPPPPRFKQFS\CLSLLSSWDYKRVPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E==Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutami«e, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possibIe 
nucleotide deletion, \— possible 
nucleotide insertion 














C 


2214 


16115 


A 


2229 


333 


1 


HTPPPPPTFHSLCTEGAPPPPQNT/PNP 
PRNPFI IPLSRSHTRANEPSLSTPRTHP 
HSPRPPLLTHPHPNPRASAPPGSRYPPR 
ARHRERPRERPTQRERERERESVCVCV 


2215 


16116 


A 


2230 


350 


3 


KKKDI PPVNIYAPNTGAHKYIKQILLDL 
NRDGP\HTITAGDFTTPLSVIiNSSAESR 
CSRPFIISPSLWGLRE 


2216 


16117 


A 


2231 


277 


1 


qmhpar6hlpqal i pvqkpali sqgms a 
sgstqvspflspcfwvevncsntkvltX 
pfcgagtgshsfaqagahwcnrgslqph 

PPGFKRIi 


2217 


16118 


A 


2232 


3 


335 


ETRFHHVGQAGLEPIiTSDDPPASASQSV 
/GITGVSLRARPGES /GSWKTVCCNNMS 

EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAIDEVRTGTYRQLFHPE 
QLITGKEDAANNYARGHYTIGKEI IDLV 
LDRIRKLADQCTGLQ\GFLVFHSPGGGT 
GSGFTSLLMERLSVDYGKKSKLEFSIYP 
APQVFTAWEPYNFILTTHTTLEHSDCA 


2219 


16120 


A 


2234 


1 


365 


GARLILVFLEETGFHYVGQAGLELLTSS 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRLIDHISKLGTRGLQGFLVFHSCGNLG 
\TGFGFTSIjLMERL.SLDYGKKAKLEFS I 
YPAPQVSTAGVEPYNSILTTHTTLEHSD 
CAFMVDNEAI YD I CRKNLD I ERPTYTNL 
NRVI S Q I VS S I TAS liRFDGALNVDLTE F 
QTNLV 


2221 


16122 


A 


2236 


614 


84 


LAAS LAACAQLSALAASHRMWALQRLRK 
LLTTEFGQS ININRLLGENDGETRALS F 
TGSALAALVKGLPEALQRQFEYEDP\ IV 
RGGKQLLH/SPHFFKVLVASRLVTLEAG 
HFCPCCAETHKW\AWFRRYCMASRVAVA 
LDKRTPLPRVFLDEVAAVRVCGHILQLG 
DTELQQHICHL 


2222 


16123 


A 


2237 


1 


393 


GPMIAQLSVFRCGS TSAPNDLWYHFIEL 
PYHGES ITMLIALPTESSTPLSAIIPHM 
STKTIDRWMSIMVPKKVQVILPKFTAVA 

QTDLKE P LKDLG I TD I VDS S QGHF CQ I T 

KZ\EKLLV/ SHILQKQK 


2223 


16124 


A 


2238 


3 


402 


HVGQAGVQIGKACWELYWLEHGIQPEGQ 
MPSDKTIGGGNDSFNTFPSETGAGKHVP 
RAVFVDLEPrVIDEGRTGTYRQLFHPEQ 
LITGKEDAANNYARGHYTIGKEI IDLVL 
\DRIRKIiA\DQCTGSQGFLGFP 


2224 


16125 


A 


2239 


2 


478 


GRGGLHRIPWTPLTPSFARGLVPSLAR 
GVEARNGAGPIKSYPRPGSRLKMQNGSK 
GSGLQNKTFHWEICDAHVNSKIQLKQ\H 
ISSRRHKDRVAGKPLKPKYRPYNKLQRS 
PSIIAAKLAFQKDMMNPIiAPAPLSSPLA 
AAEAVSSALTLPPRPFCFV 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQISCPTKKGIKSFLVLCPLNF 

FIiFWRQDIiAHPGWSAVTQSWLTAASN\Y 
GliKQSSYFSLLSSWDYRCIPPHLGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


EPPPPGGEKKKRGKPPPKKTPKKDGPQK 
KSAAFFGG\GKIKKKKGAFRKKKKKGGG 
PPPKKKPPPRKKKKKKNFSPLPPPKHTP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C==Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G^GIycine, H=Histidlne, I=IsoIeucine, 
K^Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLFQKPPTKI FLFSPPLLFLNILFFFPIi 
SPFIF 


2227 


16128 


A 


2242 


3 


143 


PNFLSPRLDCRGTIRGHCGLYLPGSCDP 
S/CLSPPSSWNYRSTPP 


2228 


16129 


A 


2243 


132 


2 


MSAHCNLHLPGSSDSPASVSQVAGIMGA 
CHIF\VFLVQTKPHHI 


2229 


16130 


A 


2244 


334 


43 


TSKHMKRCS ISLATREMQNKTTIGGHFV 
PTRLANIQKPENAKYRQGCLMC /WCWVC 
NLASPLWKTIWHYLVKCLPYSSAIFTLG 
\ lYPEEVLASSVPGT 


2230 


16131 


A 


2245 


312 


2 


PPSKLPEKNFFFFFFFLRQSL/SSVAHS 
WQWRDLGPLKPPPPESKQFSCPTLLS I 
WETQTALSQDCATAIiHPGGQRETLSHKI 
CVCVCVCVCVCVCAVCIYIN 


2231 


16132 


A 


2246 


3 


230 


RAQAMVSTSRBRCLLRPPQIETR\HVAQ 
AGLKLLASSDPPTSASQSAGITGVSHHT 
WPQPLTFCPHAKSLPFINQI 


2232 


16133 


A 


2247 


2 


214 


GRVDIiVIQAGVQWHDLGSLHPPLP\GSS 
DLP/ASDSQVAGTAGRFHYAWIil FF\ VF 
FVETGS /HTQVSNS YDVL 


2233 


16134 


A 


2248 


3 


139 


EGVQGCHHISrSLQPPTPG\SSDPPTSAS I 
E AGTAGSHYHW7L I PLLF 


2234 


16135 


A 


2249 


3 


160 


EGVQGCHHSSLQPPTPG\SSDPPTSASI 
VAGTAGSHYHVWIilFLLFCFEGDAA 


2235 


16136 


A 


2250 


319 


3 


GQKRYKrFIiCFFLFGWLVFLRI^rSLTLSP 
RLERESVSKKPKSQKPPIKKPKFTPREF 
KEVLAKYGTKFFFVPFGLFVWFFLRWS L 
/NSVAQAGVQWRDPGSLQAS PRP 


2236 


16137 


A 


2251 


15 


394 


PVS FS FFPSFPSHLFFSSSS FPPFIjPS F 
HFSFLPSDRPSWPS/FLPSFLP 


2237 


16138 


A 


2252 


60 


306 


GRERILEEIMAEDFIiNMKDLNISrQVA 
QQIPSKMNSKRPHRYQHFRSQSQRILKA 
TREKQLATYKGSSVT/VSPSPGPQTVNS 


2238 


16139 


A 


2253 


3 


351 


GFHHVDQANLKLLTSSDLPASTSQSTGI 
TGI\DHCTQPNFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDQPGQHSKTPS \HQK 
KTKTSQAWRHAPAIAGTRQAEA 


2240 


16141 


A 


2255 


357 


1 


LNXJSFLSLTIiYAKIlTLKWITGTlWKHKTT 
KFLGGKWGANLMDTRLDNAPLDLTSKAQ 
LTKEKIDKLjNFIKIQTF/CSIKDLLGNL 
KRQAPEEKKILRNHISNKELVSRXHKEV 
PKUNNKK 


2241 


16142 


A 


2256 


397 


1 


FSLFPPVGGQGGFFSSCKSPPPRFRAFF 
CPNPFRKGGNRGPPPHPGKGTLGPFFFF 
PRQS FALS PRVEYNLCLPGS SDP P AS AS 
QVGGAPGLPPPAWVNFGIFFF/GFFLRQ 
SRSVAQARMQWRHLGSLQAMP 


2242 


16143 


A 


2257 


2 


132 


TLTLTPRIiECSGTIS/AAHCKLHLiPGSR 
HSPASAPRVAGCGGGHIi 


2243 


16144 


A 


2258 


214 


347 


KISLILGVHKICCEF/CFFEMKSRSVTQ 
AGVQGHDLSSLQPPPPGL 


2244 


16145 


A 


2259 


347 


2 


FFSFFFFFSEAESRSVA\RLECSDTVSA 
HCTLHLPGS 


2245 


16146 


A 


2260 


333 


1 


SDQRWTENAFVELRDEGFACPSFSEL/R 
STPSTSGEEVENFEKKLDECITRITNTE 
KCLKELMELKAKAREPREECRSLRSRHN 
QLEERVSVMEDQMNEMKREGKFREKRT 


2246 


16147 


A 


2261 


2 


357 


SPRSCSVYGIAIIiLFI^YFLYKIiAPAIiLY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C==Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Tlireonine, V^Valine, 
W=Tryptopiian, Y=Tyrosine, 
X=Unknown, *=Stap codon, /^^possibJe 
nucleotide deletion, \=possibIe 
nucleotide insertion 














GIALNSPLiHEIQEP/SIiASGSGPIjSRiTS 


2247 


16148 


A 


2262 


355 


0 


PPLPSPLSPPLSLSFFSLPPFPFFFPFF 

PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFIiKKGPSFFPRGEGRGKDSRSIiKQLTF 
GFKKP/SCPSFLRKW\ETRIiCPPAQKIP 
FPFFFLLEMGFCYIAQGDLKGSSQSSGI 
TGVSYHIWPTFIGHLTMCIiAKC 


2249 


16150 


A 


2264 


270 


92 


DRPARRKMPFTYSSRIiFFFFFFPFFFFF 
FFFFR\FFFFFFFFFKFFFFLFSRWSIiM 
IPIS 


2250 


16151 


A 


2265 


1 


417 


FRPPAGVQWRlTLISLQSLPPGFKRFLYIi 
SLPSS /WDYRCMPPQIiA/NFPVFWFDV 
WWWVLVEMRLRHVLARLDIWIiICI 
SLMANGMEYLFLCSFAIHLPSLLKCRFT 
YFAHCLFL.YYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


LIGLCEDTNLGAIHAKRVTLIAQDLQUi 


2252 


16153 


A 


2267 


310 


417 


RVPYGTLGSGPLATMAGFDDKLKPHMEG 

AAAPILVR 


2253 


16154 


A 


2268 


186 


464 


NSCLSHNQRQLLFRLEKGMGP I S AQEKL 
VLHTLSGFSGLVXVGWLVFEMESCSWQ 
AGVQWRVHPP PRFKQFSWLS FPS SWDYG 
HVPPCPANF 


2254 


16155 


A 


2269 


1 


427 


P EP PS LP PDGAKKQKTQKS KKWR I CFLR 
KKKKKKKRKKQKKKKKKRGGGSQKKSRR 
GPKLTTETNIIILIKGGLKKMNYREIEK 
KLLFGGGGVIGPTPTQDIKGREEINYLE 
AVGREKQRFISIiVK/TNVA/HEATRDTI 
FRGYL 


2255 


16156 


A 


2270 


399 


44 


isfqlllpitvlpghvrvlmemykevhv 
vfvpXstttfmlqsmgqgvtltfksyXy 
lrntfnkavaaltsds see sgq s qlkp v 
waafsslnaskni /rdsweavkipalkg 
vwkkwslkm 


2256 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERERE/QRERERERE 
RERERERARRNIYHTYRPRPPRVPPFFP 
PPFFKKKRRGFLVPTPPGGGGAQKKKTS 
IiEGGKGVPFKGGGKKTPLKNPGGWGEPP 
QKKNGGGGPPPRDPPPPPRPLF 


2257 


16158 


A 


2272 


463 


20 


SYNIPLSQSLIQ/SRALTLFDSTKAERN 
RRGK\RSGKLQWEGSRGWLMRPKERSHL 
HNIKVQDEAVS YPEDLDKMDALNTKQQ I 
PSVHKIALYSKKMPSRIFIAVERESMPG 
FQASKDRL/LLLG/ATAAGDLKLKPML 1 
YNSKNPRVPRAEF 


2258 


16159 


A 


2273 


474 


82 


VGVWADFLKNTSQAQATKAKMDKWDPI K 
LKSFCTAKETISKAKRQPTEGEKIFANY 
TSDKGLI /SRIDKELKQPYRKNPNNPVL 
KWAKGWAQWLTPVIIiALWEAKAGRTQGQ 
E lEATLALPSGLFCQVFLC 


2259 


16160 


A 


2274 


152 


3 


AEGRNGDGIIQKSS I \RTLLSNDKNPQN 
IHRRPTRFLTMLYQQNLCHLGL 


2260 


16161 


A 


2275 


32 


361 


LGASARYEKPTVNLILNGERLNIFPIR/ 
SKTRLGYLLSLLLFNIGLAILASAINQK 
KEIQI IQIVKKKIKTQRKKKKNKTKKKK 
ALFKFKGGPEKEKGPPKNPFKTPPWF 


2261 


16162 


A 


2276 


329 


487 


E FVNI T I KI ATSLHYKAI VI KA/ AMVIW 
YWHKNRHIDQWKRIESIiEINPHICNK 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/535,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S-Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=:possibIe 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


AHHS CSHLLTTAAHYSCSPQLLTTTAAH 
TYSPPQLLTPAHHHSCSHLLTTTAAHHS 
CSPPQVLTAAHH\GAHTCSPQLLTPAHH 
SS PQLLTPAHHS CPHLLTTTAAHHSCSP 
PQLANSYRG\SPYCSSWSQTPGFRKSSH 
VGLPEPWIPHGLQW 


2263 


16164 


A 


2278 


4 


467 


knvtqqgkihirdkldeiviwgntsvpctn 
hmkhqtwfnakkcnvfkecgkXtacnfq 
ltqyqishanqkpyecqicgkpfrkrah 
ltqhnrihtggkpyeckecx3kvficcst 

lilQHKRTHTSEKPYECLECRKTFRRSAH 
LrRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FIiCVCACVCVYVCMGMCACAC\CMCMCG 
CVCVCVCVCVGLRGLGWGAWCRSWGPP 
LCFLLLGIIiPLKSRIiIiWriPRRTI S ICTL 
PSAQGPLPAPGFGKYASNTTGVKGSSSV 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKS CHCTPAWAT \SVTI.SQKQNKT 
KQRRTLGS I FPQHTFMHLKKEKSL ILQK 
W 


2266 


16167 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFPFFFFFFFFFFF 
FFFFFFFFFFFFFLFLF 


2267 


16168 


A 


2282 


68 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPRLP 
SPSPSPSIiSPISRLPSP/SRSPSPSLS 


2268 


16169 


A 


2283 


3 


454 


CQSAPLGGASQLGYSGVRDTLEEAVCPF 
SDIWLCAGRTTTLFKAVRQGHLSLQRFR 
LPFVWLCPAPRGGVYRGRQASLSCGGLH 
QVRASRPIiCLPTQASAMAGAPPPASLPP 
CSIjISDCCASNERGSTGMGP/SEPGTG 


2269 


16170 


A 


2285 


1 


452 


LKDSGRDYVSQFEGCALGKQLNLKLLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMSKDLEEEKAKVQPYLDDFQ 
KKWQEEMELYRQKEEPLRAELQEGARQK 
LHEL/QEELS P/LGQEMLDRARAHVDAL 
RTHMAPYSDELRQ 


2270 


16171 


A 


2286 


3 


266 


NSSPPSSGHSTPRLAPPSPAREGTDKAV 
SALKSPQPNRGMGRGQR/PGLPS/DTAT 
ITPHTSGFPKQPQLSLKVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


lYTFVKSSSKTLRPRHIDQWNRIENPEI 
KP / EYSQL I FDKANKNIKWEKDTLLNKW 
CWDN 


2272 


16173 


A 


2288 


81 


487 


TVYFKPTVETY\CWDKKKIPFKGLLRRD 
NTPCYPKSLLEMCEKINIVFAPATTTSS 
/LKPMDQGVIVTFKSNYLKNTF\RLGEG 
RKKKKKKGRKKSFHKALAAINSDSSDGA 
GQS KQKT / FWKGFS I PNAIKNI QDPWE 


2273 


16174 


A 


2289 


75 


469 


SRGVAGAPPKPSPSTPSPGPLDVTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\PPATLGPPSAGPE/PSPGS 
CTSTGWGYSFCCPRCRRWMRWQPPQQGP 
AWHWWPREFP/ PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 


PTQPLRVELTTVTLRCDINKWDYIKIiKS 

fctaXkkrqpterekifpnhvsnkrlxs 

KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


slthrvaggaavtpiahagarqifflgd 
phptssiilgwgpawdpcafqvsdhpass 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=L,ysine, L=Leucine, M=Metliionitte, 
N=Asparagine, P=Proline5 
Q=Glutaroine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=5Tyrosine, 
X=tJnknown, *=Stop codon, /==possibIe 
nucleotide deletion, \=possib]e 
nucleotide insertion 














RVGKTLLSCPGGGRLPWVTCPGGGQLSC 
LTQMHLTPAGMCSPCVCVCVCVCVCVPN 
GMCPRCMCVCVCPCRDVY/ ICVCVCVCV 
CVCMCLCPCWDVFTEIi 


2276 


16177 


A 


2292 


141 


392 


SCSTEVKQPKIGVREVDPVAAPGIAPSS 
PQKKQTTSTSSLKSIiLRLSKVKFIiENIV 
LDVLARE IRQEKE I KGIQ /LGKEE 


2277 


16178 


A 


2293 


14 


291 


YVGTEGXSFISFMRVTNYMTRHIiATLRE 
S\CYSR\VyPRFIEFLRFDIQSTGQ/RI 
TSRQHPPR/DLRDALIiYLNRRI TLVRTR 
CKSVAKRPPGSP 


2278 


16179 


A 


2294 


110 


293 


LLSNRSLAIMLCGGCNELRSCHOTPA\ 
WRQRETVKKKKKKKKKPGWGFLNPPPQR 
GKLCF 


2279 


16180 


A 


2295 


96 


313 


WNGCYLLSNRSLAI ASLCGGCNEIjRS CH 
WTPA\WRQRETVKKiCKKKRKKPGWGFLN 
PTPQREKLGPLKRGPGF 


2280 


16181 


A 


2296 


234 


2 


CCLETTRSIiFDKGTKENTQWGKDSPSNK 
RCWKNW I S TCKRMKLiDP YL I P / YTNINV 
KY/ 1 KDLNLRPE I TKLLEENLREK 


2281 


16182 


A 


2297 


334 


6 


KLFSPPGGGGGPPPPPPKKGWVPPETPX 
KRGGGAPNPPPPKGGGAPKPPPQKKNS P 
PKKKKKIFCPPPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPLAMYA 


2282 


16183 


A 


2298 


467 


8 


LPGFKASTDRDNWAGDFKLKQMLIYHS 
ESSSALKNDTKSTLPVLYRNKEAW/VTA 
HnLIPWCTEYFKLIAETCCSERKrSFKI 
LLLI \DNAPSYPRALMKMFKINVFMSDN 
TTS IVYSTDQGVILTCNSYYLRNTFYKA 
ITAIDSDSCRMPQEGN 


2283 


16184 


A 


2299 


1 


449 


S I YLS IP/ FNLS INLS I YLS I P I YLS I Y 
/HISVYI YL/S I YLSSIHLSAI I CLS I F 
QSISLSLSINLSTYLSTIHLSLSSIIYS 
S 1 1 YHLS IH/LS YYIjS 


2284 


16185 


A 


2300 


1 


445 


QAGLQLIjTSGDPRTSGLPQCWDYRC\DH 
RSWQT 


2285 


16186 


A 


2301 


241 


22 


KWVLGTCACVHVF I CVYMCLYVCAW/ C 
VSMCIiYVGACVCMNQC/VTVCMCVCVCV 
CRFVCVGIPPHSKWISIG 


2286 


16187 


A 


2302 


3 


478 


GGQTETLIjTSQRKGGWPEALLTSQMGRP 
GRGAPHI PDDEQPGRDAPHLPDGAAGQ \ 
SAPHLPDGEXPGRGAPHIPDGAAGQRRS 

plprwgg/elgrgaphipdgaagqrrs s 
hpr/pgrpgrgaphfpdgatg/q/dgap 
hfpgippdgt 


2287 


16188 


A 


2303 


440 


41 


KSHHLS FLS FLLFFFP / TKSHS VAQ/ CW 
SAISAHCNIiCLPGS SHSSALASRVAVTT 
GVYRI 


2288 


16189 


A 


2304 


2 


395 


FFLVKTRFLHVGQAGLKLPTSGGPPALA 
SQ/SL/RXTGMSHRTQPE 


2289 


16190 


A 


2305 


184 


2 


S IKKPKPLVmjRNH/NT/ WQGAVTPACN 
SSTLGGRGGRITRSGVQDQSGQRRSSEF 
LRLGYGLD 


2290 


16191 


A 


2306 


86 


472 


I IKLCKWQNK/RFEAGSLVPEMGFYYVT 
QAGLELIiFSRDPHTSASQSAGITGAAFK 
QRWSVGTVLLQVDRGTPPVGDCGSRTPQ 
WPGQAFLRTALKSEAHPPHSSTDVTPVL 
WSEGSPCLIiSPPSLSFTG 
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SEQI1> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIaiiine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=PhenylaIanine, 
G=G!ycine, H=Histidine, I=1soieucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \»possibIe 
nucleotide insertion 


2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAI PXTTREAEAGESPEP 
GGGGCNEPRSCHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAFPDFPGSTHPLTSASQVAWATGAH 
HHSWLXFVFFVETGFHL/SELLSSSSPL 
ASASQSAGITG/REPPCLTSPFPF 


2293 


16194 


A 


2309 


253 


365 


TDYFYLFIYIiRQSIi/SSVAQAGVQWQDIj 
GSLQPPPSGFK 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKIiKDANTLNIKVWRKTI 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 

GDEE/HFIKIKKSVIQEAIIIINIYA 


2295 


16196 


A 


2311 


167 


2 


LCYCVIFVFFIFWRQSLTIiSQGGMQWCD 
HGSLQPLPP\GLKRSSHLSLPNSWDYRR 


2296 


16197 


A 


2312 


67 


482 


DHLIPGGGGCSELQSHPCTPAWVTA/ET 
LSQKKKKKREK 


2297 


16198 


A 


2313 


188 


437 


AGFFPPENQLTNMKVRKANESDPWGVKP 
ESDESETHGSLSLSLSLFFLFIFLRQSL 
/NS V I QAGVQ WRNLiS S LQR ISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 
GRGGVYRGRQS SLS CSRLHPVRASRPliC 
LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


S P PRN WD YRCVP PRPANF VFLVEMGF / Y 
HVARPEFGLELPTSGEPPTLA/FSKCWD 
YR 


2300 . 


16201 


A 


2316 


98 


288 


LMAWPATWEAEAQESLEPGGWGGGEEG 
CSELRSCHCTPAWVT\SETLSHTHTKKK 

KKNGAAL 


2301 


16202 


A 


2317 


1 


410 


LNHIPNLSLTKRKPSPHSLNLKKKKKKK 
KKKKKKKKKKKKRGGGVKKKPRGGQKKK 
GGEKKWFPSKKGGKKKKRGEFGKKNFFG 

GGKKREKTPQKKKSPKGKKKNLR/EERG 
EKNPKRGGEKKRS SSPRNIJNLRGEKK 


2302 


16203 


A 


2318 


249 


3 


PLKASS PPKAFNFCREVGP I CPPPKKKV 
P\PKIPKLVPIPPPIRKKLLPCPPPLTIi 
APPRVPLKRPP 


2303 


16204 


A 


2319 


2 


393 


AHLGLPKCWDYRHEPPRPAPIiFIiIiNSYF 
GLDLIjT/S/GDPPAI»ASQSAEITGVSHC 
AQ/PEYVY 


2304 


16205 


A 


2320 


389 


2 


RGNSNIGGPGPLRGKKFSPPPPLKNWGT 
KLGPPPPPPPFFFFF\GKPGFS IFFKPI 
LKFKGPKKPPPRAPKNGGSPGPGWFFFF 
FLSNFTQLPGIKIiSKGGKKNFASPPPPF 
FFFVHYKTGYPSATPDAW 


2305 


16206 


A 


2321 


389 


206 


FQWRWGF/NHVGQANLELtiTSNDPPASA 
SRSAGTTGMSHHAQLKNYFLMVRMWRNW 
lAVGM 


2306 


16207 


A 


2322 


371 


42 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQIPPPPKIFSSPFPPGGFFSPPPE 
R/VDFFFSPPPFFFPPPSFFLSPPPPFF 
FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFFTPPEEGFLKKPNRREGPPS 
PITDPTLWPNMMKGIVPKAPPIIFMGGG 
INMTFSGFVTTKAPFPRPLRFNPMLRQG 
FDLLTLKASXWGSSASWY 


2308 


16209 


A 


2324 


1 


413 


RGSGDNRHGPPCRWFVFLVETGFLHVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


16210 


A 


2325 


58 


400 


SETLVSKKKKKKKKTPPPPKNPKKKKNP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, lj=heucine^ M-Methionine, 
N==Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V==Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibJe 
nucleotide deletion, \— possible 
nucleotide insertion 














PPPPPGAKKGPGGKF/PPPPIFQKECRGG 
FKK/SPPKKPPKKKKTPPPLGKKGPPPP 
RGGPLGGFPPPPPQKKNGPPPPKKKPPF 
FFPPK 


2310 


16211 


A 


2326 


462 


306 


AHHNLRIiPGSSDSPASASGVAGTT/GMC 
HHARPILyLSGDASADLHCFSSAQLF 


2311 


16212 


A 


2327 


1 


393 


SRPPSRTEKIRWFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ ITGMSHRAQP 
TS 


2312 


16213 


A 


2328 


391 


2 


AWFKETKAGWI IPRSWDRGSWVSQPYSA 
LTSSPESGFHSVTNGDSMTCLEDGKHSL 
VAPHGIPQR6SSLQDGVSFCPAPTMKPY 
IiASTKM\RLHEARELGFFSFLSLRQGLA 
LSPQLECSGAILAHCSLNIi 


2313 


16214 


A 


2329 


2 


115 


GCSELRSRFCTPAWATNSKTASQKKKRG 
KKKKGGVFYF 


2314 


16215 


A 


2330 


2 


406 


AAAPSALALRDGWAVRPELDLLPPCGEE 
VAPGAHCLGCGPS P CLFLS PSHTRQS PP 
APTS S PGLSTS P PDVPTHVSAPHSSKGP 
PSIPGAQALRGCGLGGWDR\PSSPSIiP/ 

PDVSPKPIiNFAP 


2315 


16216 


A 


2332 


226 


377 


KRKSKHITFLFKTLSWPDTVAHACNPST 
IiGGQGGRl/TPRSGVRDQPDQHG 


2316 


16217 


A 


2333 


3 


191 


CLSPGGGGCGGLRLCYCMPAWVTXNETV 
SQERKKERRKEREKEIKKERKKRKKERK 
KEKKKKG 


2317 


16218 


A 


2334 


295 


81 


FFKPFFPPKKTCPEFS\FFWEKRGFPPP 
PLRSFFQNPPPKGGPPILLKGPPPPPSL 
GWPPPPPLFFFFFFLR 


2318 


16219 


A 


2335 


375 


8 


TQIVPLPSNLGNKTRLRLKKKKRNEQGN 
IPTDTIDNRIKQI IQ/TYYEQIiCANK\N 
LDKMDKALESHNFPKLKQRESLNI/HSA 
KEIHFI/ILNIS TKKTPDPTGFTGKFLQ 
I PKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFPKQNLA/ SVTQVGVQGQYFRSL 
QPLPPRVKPFSFPNPLSNRGYRGPPLGR 

VRQENCPKLKSKRFQLNKIPRLGKKKKL 
RFPKKKKKEKEKKIVKT 


2320 


16221 


A 


2337 


411 


57 


KKPRS FSSCSSSPP FFFFF S PPPKKKI F 
PPPQIFWGPPFPPPPPFFKPPPPPPFFS 
PQKKKKI\SPPPPKKKFFFKTPPPPFFP 
FPFFFFFFFFFFFFFFFFFFFFFFFF 


2321 


16222 


A 


2338 


97 


354 


AKAPSLSIjVLFSFSTFFLGIQGLALLPM 
LECRGAITAYCSLNLPDSSGPPTSAPSP 
TPY/RIAGATGTHHNALLLFKFFSRDGL 
PL 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPSIHWSIYLSIHP 
S IHPSIiYHSS IHLS IHST I YPSTCLS IH 
/ CI S IHPS I YPS I CPSVHLIiiVHS FIHST 
I CPSMHLP IHVS I QHFLSAQ I LP VS VFG 
EVSDI 


2323 


16224 


A 


2340 


506 


0 


RDHEQLGIVRADKKKKKKKKKKIKKKKK 
KKKSSSSSSSPGTFFRG/VPLKDPVG 


2324 


16225 


A 


2341 


203 


1 


VELRVRATEPGFNFLKGIYCTSMVDWIK 
KMWYIYTMEYHAAIKG/DRIMSFVATWM 
ELEAIFLSKLMQ 


2325 


16226 


A 


2342 


402 


40 


PYPPKKKASPTDA\FSSSSSSPPPFFPF 
SPPPKKGFFSKPFFFFSPRFFSFPPFLK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Iso]eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R~Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPQFFFFGPLKKNFFFPPPALKFFFFK 
SPPPLFFFFFFFFFVEAGVLLCYSG 


2326 


16227 


A 


2343 


181 


1 


QLKDQASDLLGKNGDEVKETIPSFIiPSS 
LPPF\LPLFLHQFLPSFLPSFFPPSLLP 
FLS 


2327 


16228 


A 


2344 


2 


407 


FVASQLGGSGVXRVRDPLEEAVCPFSDL 
QLRAGRTTALFKAVRQGHLSLQRLLLS F 
\VFLCPAPRGGAYR6RQAFLSCGGLHPV 
GASRLLCLPKQAWAMAGAPPPASLPPCS 
LISDCCASNQRDSVGARPSEPGAGH 


2328 


16229 


A 


2345 


405 


178 


I IKFI YEKATA/ LLNG/EKLKAPPLKLG 
TRQGCLHSLLVCNTVLEVLTGTLRQEKE 
IKGIIIEKKEVELSLFADRIL 


2329 


16230 


A 


2346 


2 


412 


FKASKASLiSPLLGANTAGDFKLKPGLIY 
HSLH\ LKNYADS ILIiVLCQWWKTCAWMI A 
HIiFTAWFTEYFSPPLRPAQKKISFKMLL 
FIDSAPSQPGVLMEMYKEINWFMPANT 
TSILQPMDQGVILTLKSYWLRDTFH 


2330 


16231 


A 


2347 


2 


397 


ESLEPGRRRLQGARIMPLHSSLDNRVRL 
CLKKERKKEIGVLIRCWQECKIVQPLWE 
IVWYFLKKLN\ESPYEPAVPLLNIYPRE 
MKIHVYTNTCTQI FIVALFTIAKSGKWW 
GTVACACKPSSSE/WLRWVDYL 


2331 


16232 


A 


2348 


3 


423 


EGCSELGSHHCTPAWAT/AESVSQK3CKK 

PKKK 


2332 


16233 


A 


2349 


49 


262 


qmckgsnrrrgkrvgsrq i s kkktnap i 
kkwakdmnref/diqmankhmekcstsl 
i iremq i kstmryhly 


2333 


16234 


A 


2350 


356 


2 


FVTAPLHSSLGNRARSYCKEKKKVQVAA 
KAVLRGKF 1 1 AYTVFKKRKI SN INLS I S 
LKTLEKEEHTETKADGAQYVTKI\RAKI 
NKIETANETKSRSLEKTSKTVEGKCLRD 
ILLSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGL.GELLFRSTVPTLQPPGCG 
ASFPVLNTIiPFXSLSPSQSSSSPASLVR 
PWVAPPFLCPHGEPDGGPDSTTSLPFPG 
PAATGPAARLIQHPASRQPRPASHTHCG 
V 


2335 


16236 


A 


2352 


360 


0 


OTFLAAFRLVFCQMTSYSIiALLSHKIiAL 
MPUSTLSDLLTRWTHCMGELFFLD ILAIQ 
NPFHTVFFIjGHPEWGMESRFVVQAVVQW 
PDLRQLQPS\PPGSMRPSCLSIiPNSWNS 
PSYGRL 


2336 


16237 


A 


2353 


34 
- 


474 


EQWRPCKELiAARQVGCPHSCFSPHWQLL 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VCAQAMCVCAGA/CFCVCVCAGA/CL.CV 
CVGA/ CLCVCAEAVS VCVQ/ VAVSMCVC 
RS / VSLCVCMQGQSLCVCAGA/ CLCVCV 
CGIPPPVLCLN 


2337 


16238 


A 


2354 


297 


16 


KPFFLKSFFFFSFFFLTPPRFPFPFPKK 
KKIFFFPPRKIFFFFLIPPPPXFFFFFF 
FFFFFFFFFFFFFFFFPFFFFFFXRHGG 
WFEEITILTV 


2338 


16239 


A 


2355 


3 


315 


PVTPATRETEAGETIi\HDLGEPGGRGCG 
ELRSCHCTPAWVTEQDSVSKKKKKKRGA 
RFKE SNFTTPGLQRNI FFLGALKL I SGA 
GVLKRRDGKTLGFPQFNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRGEIPSLPKIQ/EIiAGCG/GHLNPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/5J5,l 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartlc Acid, 
E=GJutamic Acid, F^Fhenylalanlne, 
G=Glycine, H=Histidine, I==Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S^Serine, 
T==Threonine, V=VaUne, 
W==Tryptophan, Y=Ty rosin e, 
X=tJn known, *=Stop codon, /^^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














GRIFGEPRSRHCTPAWATEQDSICKSHS 
RSGLWCYKSSSVYLSTRGVWVRGIASV 


2340 


16241 


A 


2357 


416 


2 


FFFFFSETESRSVAQAGLRTXQWRNIjGS 
LQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVPWPRGGYFKSLQPPPPGV/SCPN 
PPKKLEYRVLFPQPSNPFFPFCIPSRDG 
VSPCCPGWSRTPD 


2342 


16243 


A 


2359 


279 


380 


RGYNP/WPGAVAHTCNPSTLGGRGRQIT 
RSGDQDH 


2343 


16244 


A 


2360 


415 


224 


FFFVFFFFLLI TF I LMIjNLLPCHKFLFL 

qfllgylfllylNcapfllvalnifits 
fqqldyytf 


2344 


16245 


A 


2361 


278 


2 


ISDPFRFWNIYRIHTGLISLIQISENCV 
S CQKFQI LEHFRFQ IRDAQS VLRK\RKA 
WTGAVAHACNPSPIiGGRGGWISRSGDLV 
HPGQHSETP 


2345 


16246 


A 


2362 


139 


282 


KKKKGGRGGGGGRGGGGXXWGGTKKKKK 
GGEKKNLWGGGKGGGKGGGS 


2346 


16247 


A 


2363 


19 


409 


PKPPSVLGGGGPARYPSPLGGPNRPVPL 
GPGVGAPPGPPGKTPFFLKIKKIYPARG 
GPPVIPASPGGEGKKSPLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 . 


383 


HSHFKNLSSIIKKIiHRNKTFTEHFSLSS 
SLNQCFLiNLTVFYSHLGNFKNSN/SWPG 
AVAHTCNPNTLGGRGGQITR 


2348 


16249 


A 


2365 


400 


221 


GRLRQENCIiNSGGRECSEPRS/HriHCAP 
AWAT\EQDSVSKNKKQNKQKQTHIYTVL 
LCARH 


2349 


16250 


A 


2366 


383 


14 


GGRGCNAPRSCHCTPAWVTERASPQKTK 
KQKNTHTKKRISSCCYKMEDPLRQA\TF 
LQCPRAEGPSQKAARMELMEKQEKNQGP 
ARHRRQEQPLTSRPCPDHLCWLSQVSS 
TPAQGLSLIiICK 


2350 


16251 


A 


2367 


274 


1 


PRKILKARGKEHLASRGTMIRMTSDFYL 
QTMQARREWSKIL/WVLEEKIHQHRII)/ 
PVKSSFKSEEEIKTFSDKQKLRGLVTSR 
SDLGKDVK 


2351 


16252 


A 


2368 


161 


2 


PFFFFSETESCCXVTQARVQWRHLGSLP 
GSSDSPASASQVAWITGTRHYAWLIF 


2352 


16253 


A 


2369 


361 


198 


ISTGRLIF VPLVEMGF \TMIiARLAS CDPPA 
SASQSTGIRGMSHNSQLKCFTEFDSFC 


2353 


16254 


A 


2370 


116 


300 


HLNGDAVEERDFMKCTVSGI I VAHCNLE 
FLG/SSDPSASAPRVAGTTGMCHHrWLI 
FVILVEM 


2354 


16255 


A 


2371 


2 


192 


MKUDPHIjS PYTKVNSRWIKDLNLRPKT I 
KILEPNIR/ ITLLGIGLGKDSMTRNPKA 
lAIKTKIiAR 


2355 


16256 


A 


2372 


1 


133 


AGELLEPG\GRGCSKPRSCHCTTAWATE 
QDSSPEKKKKKKKGGF 


2356 


16257 


A 


2373 


43 


403 


LHDSPAIiAS QGAGTTG VSHHARPAAGIN 
SRIGQAEDRISELEDWLSEIR\RQSGRN 
MDKRMKMNKQNLQEIRDYVETKSM/NTR 
LIGVPERVGENGSNLENIFQDLIHENYK 
GKPIRLMVDL 


2357 


16258 


A 


2374 


404 


215 


GQGGRITRAQKFETSLGNWRPGSEAQE 
LiIiE\PDDRGCSEPRSCHSTPAWTTEQDS 
VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


SKQLEFTQLYTKLNQLNKTKISDLKKEK 



688 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etJi 

od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutaniic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparag!ne, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X==Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YPIRLFSKENTQMASRYRTKYSTWLI IR 
EMQ I KTTFRHKLAALASSRCLLGLG\AT 
SAHCCTVGAPLW 


2359 


16260 


A 


2376 


413 


1 


PKKGLFPKPIIWVTPGFPPPPRFKKPPP 
KKI FGAPKKKKKS PPPPAKKFPFFKGAP 
PPFFFFFFPFFFFFFFSGDSQERVREAM 
PVAGGP\PRPHSLSAPHAPGGTAWTPMH 
PVQTHKAQSPKLPASECPPPTTPLS 


2360 


16261 


A 


2377 


398 


247 


RRFHHAGQAGLELLTSSDLPALASQXSA 
GITGMSHHAQPSATHFQKHIiVS 


2361 


16262 


A 


2378 


2 


143 


QEl«iLNPGGGGCSELRSCHCTS \AWVTT 
AKLCIiKKKKKKKPFGKGGG 


2362 


16263 


A 


2379 


417 


90 


FFFFPFFPGGGRLQVCPPPGILLFCFLY 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE/QGGVSPDGPGFFVLPTPK 

KPPPPAPQKGGEPKFKPQVWGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VHPLNHHDQKGQASSTQKKKKKKKKKKK 
KKKKKKKKG/WKGGGAF 


2364 


16265 


A 


2381 


402 


2 


NFFIiKGRGWGPLPPFLPRGPPPRGGPQK 
/RGGLGGPPPFFFGSKTPPAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 
RGGVPSPKKKKG/APPPFFPPPFFPPPP 
PPFFFFFFLRYNLALLPRLGCSGT 


2365 


16266 


A 


2382 


166 


5 


THGHVIYVDQEMQMEyilENM/WPGTVAHA 
CNRSTLGSRGGWITRSGDLMVKPRLY 


2366 


16267 


A 


2383 


2 


457 


TSQPSLLSSWDYRSTSPRLiANFMLFYF 
lYFFFAFSVETGFHRVSQDGLNLiLTS/S 
/ I P S I PRI PKHWD YRHDP \RTWP 


2367 


16268 


A 


2384 


2 


417 


GRVGFSQSNGNPSSLSFTLLKVDFEVTI 
PGEGKDRIFKVSIKWLAIVSWRMLHEAL 
VSGQ I PVPLES V/ QALDVAMRHLASMRY 


2368 


16269 


A 


2385 


318 


32 


TMErMLDKKQIQVIF/FEFKMGREAVET 
THNINYTSGPETVQWWFKKCCKGDESLE 
DEE CSGRPE VGNDQIjRAI I AHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


3 


ISQAPSTPPRKRGFR/PPQKTSFLGPPF 
YAAFYQEKKFLFPSSDPPREl'GDKGKQK 
GFPPPKVAPKKKGPPKKGPPPGKKKDPP 

SFFKHKSQPTRPPRAPALEGPRSR/SAA 
LQPGDRRR/PPSQKKPTRPPTRPPTRP 


2370 


16271 


A 


2387 


415 


10 


KKGPFPFGPFIFLFPPPGFLPPPPPPIF 
WFSGFCPPPPPLFYFPPRGGPK/HXLVF 
PLFFLPPPPFFFFPIFAKESPPKKGPGP 
LNFPPFGPKKN/S PPPPFFFFFCEMEPH 
SCRPGWSAKWHDLGSLQPPPPVFK 


2371 


16272 


A 


2388 


410 


95 


katmdksdhiklksfypaketptkvktq 
ppewekvfanypsdkglipi lykelkqp 
ygNkksnnsikkwakdlnryfskdiqma 
nrcmkrcsrpgavahtynpst 


2372 


16273 , 


A 


2389 


362 


92 


rflfffspppkkgffpkpffffsprvfp 
ppfplnpppklif\gppkkkiffpapgg 
kkifflkgpppfffffffffffffffff 

PVENTFY 


2373 


16274 


A 


2390 


131 


487 


ATEHEKTEKSSLSFFSISKRKKKMEKLH 
DIGFSSNFLG\RPKAQATKGKTDWTSAK 
LKICSSRDTISRMKRQPKEWAKTFANKS 
CNNKKPEKIDKNKKKKKKKFLGGALLKK 
TNLKPRG 
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nucleotide 
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ing to first 

amino acid 

residue of 
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Predict- 
ed end 
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otide 
location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutam!ne, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, V^possible 
nucleotide insertion 


2374 


16275 


A 


2391 


408 


145 


RWSIiALSPRLECSGVSSAPCKVPPPGVT 
PFSCLSLPSNWEDRCESPPPAQ/IVFIG 
EGFYILHGFF/RRGPKIRCFISGCPPPV 
LSFPT 


2375 


16276 


A 


2392 


1 


203 


LFFFAFSVETGFLHVGQAGLEPPTSGDP 
PVSA/ PPMCWDYRHD 


2376 


16277 


A 


2393 


386 


16 


TPSAGGRHIEXiSLSTCPSCAQHQGKEHL 
EGGEGGGAQSLTTAPSSATSSQDPISAH 
AVEDKLiSIRLETDILKTKS/WPGAVGHT 
CaSTPSTLGRRRQANHLRPGVRDQPGLVI C 
RPRPPKVMGLQA 


2377 


16278 


A 


2394 


3 


396 


QLLERLKQEDRLSTGSQGCSELRSHHCT 
PAWATEP\DSVSK 


2378 


16279 


A 


2395 


273 


416 


FFYS FL I KIRWKKQ PGMVAQACNPSTLG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


FIPIESACSQSCLKPNLRQEW/YIFGTL 
KLIFFETESHPVTQ/DWSAGEIiL\DPRG 
RGCSEIjRSCHCTPAWATRAKIiCLKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAVWSPKGLFS / 
RIWKEAPILSPQKKKKKTNNPIKKWAND 
ANRHFSKEGTQQ 


2381 


16282 


A 


2398 


48 


393 


S ILKTRKPPLKKGRGRRKKEKESVRTHV 
PFSYQSNAFPSKPLlSnSISMTLGRLLSFSF 
FLVHFFFYTDGILIiCGPGWSRTPGLKGS 
SHLSFPKCWDYIHEPPHPAYHS/LFCR 


2382 


16283 


A 


2399 


128 


383 


EEAPKHFPKPNLH/ QKKVIiVTAWEVAAAG 
LIHCSFLNPWETITPEKYAQQTNEMHQK 
LQCLQLALVNRKGPILLHHlSrVRLHITTH 
AS 


2383 


16284 


A 


2400 


54 


384 


LFTFILNSVFHTYMCIiYFWTIiFFSVNPF 
VSMP I PQCLDDS S F 1 ISLDSE INPNI YS 
QLI FDKKTQRGKNSLPDRWCWENCIFTY 
KRMKWDPYL/SPYTKIISNWMKDLNIK 


2384 


16285 


A 


2401 


393 


1 


HRGENTHQQGGGLSRWRRHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRKRS 
RAGD lEVEKTQQSEGALEVRIRSRQRGT 
SRWRS PSRQRVS FRWRI CSRQRCSS / KV 
RRRSRQRGTSRWRRHSRQR 


2385 


16286 


A 


2402 


1 


330 


RPPPPPHCWDYR/HEPPR\QPTIiWVIFK 
LSVETRLCYVAQIGLELLGSSKESSRLD 
LPKCWDYRHELLCWMVIFQBKLV\SGFL 
FKIPRFFKAGMEI FKQIEGPWfl SHPLAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGLRGTHGPWEQAGISGISPSNSFL 
PVCFRQSVALVSQAGEQWRRLGSLQPPP 
PGCGRESC\PSFQGGITPSCFFVFLEKM 
GLHRV6QAGLQLLMSGEPPALCSQRRGI 
TGVSHHARPPSKGFIWHTGAPAM 


2387 


16288 


A 


2404 


239 


573 


VCFGVFSIiSHFFESEFSS I TQAGVQWLN 
tiGSLQPPSRGFKQFLCLSLPSS\YWFTG 

TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 


1 


122 


PTRPGRGCGELRSCHSTLAW/VNSETTS 
QKKQRKEKKLPLL 


2389 


16290 


A 


2406 


417 


130 


QALRVKHVQLVP S S DFLAKTRAQS VNNL 
tiRHSSSSSSSSSSS/SSSSSSSSSSSSS 

sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LRGAFLGRFCP IRR 
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nucleotide 
sequence 
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NO: of 
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sequence 
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SEQ ID 
NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M~Methionine, 
N==Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=TryptopIian, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \='possible 
nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQSLRYMAHQPVCCFTCSILIRAELN 
SI/ETKNKKIEKINET/RKSWFFEKINK 
MGAITPDATERQNI IQGYYEQLYMQNLE 
NLEYMDKFLGRFKP PSLNQEELDTIiNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKRHSQPRKQSKLR 
EGNAQRWNIRRGPGMVAHACNPSTLiGGQ 
GGQV/TLRSGVRDQPGQHG 


2392 


16293 


A 


2409 


94 


386 


TSFALVALGGRSCSEPRSCHCFPAWVS \ 
S ETLSQWKRKTPTLNNAKYWGGRFLRFF 
FLTRGGGGRFFFFGGTKTKSLGAGFKKG 
GGGKPGGPPNKGLG 


2393 


16294 


A 


2410 


1 


384 


PTRQVRIKLFPRIYVHTRKRLKNIYGNT 
ETFITQPFKNHSWQLIRNNRNAKKQEN 
VAHNEEKNKKQS IETN\QKYTHNKNLDT 
TKKKKKKKKKKGGGPIiKKT I FKARGGEN 
NFFFLGPQKLNSGAGF 


2394 


16295 


A 


2411 


423 


2 


FFFFPPPQKKSLFPPFFFCGPKIFSSPP 
VFLTPPQKIFFCPPKKKKYFPPPRGKIF 
FFLK\PPPPFFFFFFFFFFFFFFFFFFF 
FIFFFFFFIiGFLLSSIiLIGVPFLiLLNAN 
IMILLQKKILKDEVQI ISSFPSTKNRNV 
L 


2395 


16296 


A 


2412 


218 


112 


LKKEISnSTTKC Y YG WGTTGTL IHC \ WW \ KL 
IQVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPGGSKPRGLG 
EPPPPPPQRGGNTGGGPGARPKKNGGGF 
FFFWRGEPP/PPPPAKGGGQDLGPGPFG 
REETNNFPAPAPPGPKNKNLGPRAQKIL 
CF 


2397 


16298 


A 


2414 


217 


409 


VKYLSYSVLLTIIGTVQACYQELRPGAM 
AHTCNPSTLGGHGGRI/TLRSGVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRFWGTKPFPPEGGGPPPPPNS 
PPGGGEKSPLFFFGGKGNPTWSPGPKFL 
KKRVIKGPLHPGGKGYKKTREFGLKPGP 
PPPKPFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVLLSSSSSSSSSSSSSSSSSSSSSCS 

LLVRYLCKVKHKSSLCAHPGTVPFLELI 

HDTDSFAH/VFPADLL 


2400 


16301 


A 


2417 


205 


486 


RHGYGFGFWKNTCMQHRHSYRRVYIHRH 
IHLTFLAYVHRESPEAMSALLTQILHiFF 
KLLSFLRRCLA/SVTQAWQWCSLGSLR 
PRPPGFKRPWC 


2401 


16302 


A 


2418 


364 


1 


KESLGDPVKDTVILFATRNPMQMGSNYQ 
FFIYLCLTHLLSRSYCWFGCKMLQ/PIiW 
KTVWQFLKKUSTAEFPYDPAIPLXDICIP 
VFTASIiFI lAQRWEQPKCASTDEWINQM 
WHMHTIEHYPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLLQPRPPRLKPSSRLSLW 
S S WDYRPAP PRPANS C 


2403 


16304 


A 


2420 


63 


465 


RLQRFLLSF \VCLCPAP I GGAYRGRQAS 
LS CGGLRPVRASLLLCLPKQAWAMAGAP 
PPASLPPCSIilSDCCASNQRDSVGVGPS 
NPGAGYYLVARRFLSPLEKRSIQVGVTL 
FSRCRLSPLSLTRKGNSLTPCAS 


2404 


16305 


A 


2421 


61 


482 


QRARI TGVSHHAQ/ LRFCLFDMRSHS VA 
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Predict- 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyJaIan ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKML/STSAF 
QSL/WDYRCEPQHPIHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECSEKCRCPGHDLQLSP6LCHHG 
VSGLAKPLLCFWPWIWKPQPLCSPTTSL 
PFLPLPPPSQCPQGPAWLELEGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVIPAL 
WEAKAGG 


2406 


16307 


A 


2423 


1 


444 


PGPGPGCGPVATLGSPSRPARTDSPSLP 
PHSQLREAEARNRDIEAHVRQLQERMEIi 
LQAEGATGESLMCPLPRRT/WEEVGRLL 


2407 


16308 


A 


2424 


189 


418 


SRRAE PGSFRGCLRVGVPS CTGV\SLWV 
CVCVCWG/ VCVCVCVCVCARA/ CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 


16309 


A 


2425 


30 


895 


IiDEQCTSEIHRRGEATARPRAPEHPAPP 
ATAVRGRDAASQNLIORRPGSGTDGLRLQ 
GAEPSRLLRTYAGGAVIPRGTPERAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 

PQLLRNDWGS CGFMVPEAARGKVFQDS Q 
EGAH I RRETVS KS VCAE P WRHQRARD PA 
PTNFPLKCQKQRGASTSSGQHGDRVNLV 
FFIDDDYSPPSKR/PKTNEPPQPPVPEP 
ANAGERKMREFNSGPHNPVEETKLICLC 
PSGHASCQVHLWTGAMLLGFQSWRKLPG 
SGLKARILQ 


2409 


16310 


A 


2426 


494 


154 


SSRVRCQCA\LLGGASQLGCSGVRDPIiE 
EAVCPFSDLQPRAGRTTALFKAQMEMQK 
S PVFC VAHAGS CRLELFL FGHLGS SLHG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLIiALEPGRQGAAPVEDLQPQGPD 
KPPPPPLPQPFRARTWTAAVPRHPPPV 
ACHPPQPLAASKPWP/SVAGGDLLPLPG 
PERPVHAFFIGFI FVHLGLGGVSGRGAV 
APARSGPVPPRPPSSSTSRFSLFFLHE 


2411 


16312 


A 


2428 


84 


409 


DYKHAPTMPRIQNFVYHSPQQPRCENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KI F 
WGG 


2412 


16313 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSLY/EKDK 
RNKAYFTK/RPSPVNDIIST 


2413 


16314 


A 


2430 


456 


0 


PGWPGRGAP\PSQTGWPGRGAPHIPDDG 
QPGRGAP \ PSQTGWPGRGAP\PSQTGWP 
GRGAPHLPDNGWPGRGAPYIPDDGQPGR 
GTP\PSQTGRPGRGAPHIPDDGRPGRD/ 

GSSLPRRGGSR?IEALLTSQTGWPGRGAP 
P 


2414 


16315 


A 


2431 


3 


344 


CRERRS CHCTPAWAT \SETPSQKKKKKI 
FFGNGPPGGPQAGLKIiRAWGFFQKRGTM 
GPGTKNHPGQRGEPPLLQKKQKITRPGG 
GGPGAPPPQGGGAGKSFNPGGGMFQGGE 
IP 


2415 


16316 


A 


2432 


1 


109 


RPLRRIiRQENRLNEGSRGCSEPKLC/HIi 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVLPRLVSlSrSWAKVTCL/ PS VSQNA 

EIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLEMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWLVCRGT IHKFRCVPH 
LTGRRFEHGVTDCYTLFRDAYHIiAGIEM 
PDFHREKDWWRNGQNIiYLDILQAPGLYP 
VPLSAAQPGDVIMCCFGSSVPNHAAI / Y 
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C=Cysteine5 D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L.ysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CGDGELLPHIPK 


2418 


16319 


A 


2435 


3 


427 


EGARTSSERHPCNKYLLLHSLAHLFINA 
LNLVLKGIiSPSPFPALPISFPIFFFSPH 
FLGTPTLEGGRADLPFLQPPGAPG/QPA 
PLNYGPGPYRNPCP/RLPQLKPAGPGHG 
LLKSPPPNPGRNWPLLGSLFDFFKKRTS 
IPLP 


2419 


16320 


A 


2436 


238 


400 


QFRWKRGKAIFFFFFLRQSPSVTHAGMQ 
WSNLSSVQPPPPGFKQFLC\PSVPSS 


2420 


16321 


A 


2437 


11 


442 


LGTRPRATDWGVRLELSRACPVLGHPAK 
HPRPQRWCKVNFSYSPEQADELKLQAGE 
IVEMIKEA/CGNPDMPSVSPGPQRPPKT 
TEDK6WWEGECQGRRGVFPDNFVLPPPP 
IKDAQPLLLFGIDQEAGPTESGISGIRF 
RRLSC 


2421 


16322 


A 


2438 


411 


80 


PQAEE6APTPGSENFNPPPPRGGGAPPP 
PQKNFFPPRGVNPGGGGGQKRPPPKKGG 
/SPKKNPGGDKNPPPKKKKNIGEGGGFI 
GAPRGTPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 


A 


2439 


290 


1 


QLNKIKKTPLLFLPWANEKVPKIPPNYP 
PGPKKKGYPPSIFFFFFLRQGIj/NSVTR 
AGVQWRDIiSSLQPPPPGFKRFSCQKPIffS 
AFPHASADAWVDP 


2423 


16324 


A 


2440 


440 


213 


PFSRPLFFFSPPPKKRASPPPFFFCFPR 
VFPPPPFF/SKPPPKFFFFPPPKKKKIS 
PPPPKKIFFFFSPPP 


2424 


16325 


A 


2441 


131 


408 


GCVPPEPAFIiCFVIiEIVSSVAQAGVQWR 
NLSSLQPSPPGLKRFYRPRLPSS/QDYR 

RAPPIiA 


2425 


16326 


A 


2442 


102 


351 


QAS S S VLKLCVCVRARLCV/ CACVCACV 
CfVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPIGRRLKIi/DPPFLSP 
HVKINPRWIKDLNVKPTTIKTLEGNLGN 
TLLDTCPG 


2427 


16328 


A 


2444 


410 


30 


VCVAPPLCVCIGTIiCVRTHTQFCVCVHT 
HSLCVCR/CQFLCVCVQ/VAVSVCVCRA 
SLCVCR\ALCVCVCR\TVCVCVHGTVCV 
CVCRDSFCMCVCAGAVCVCVCVCVCVXC 
liCVCVASAQLLGRGFCSSVNIKGAGP 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSNLFFFGPTINFFTPQFK 

QGGGQNPNPFLFFFFFMRQGIi/DSVTQA 
GMQWCHAGSLQHLPPGLKQFSCLLP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTL.GGQGGRI 
/TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


LGDVCAFFFFFYLKNQLNPGGKIiILPVG 
PAGENQRWEQFDKLQKGGIKMNPLRGGI 
SVPLTKKKNQ/WAR6E 


2431 


16332 


A 


2448 


239 


3 


SPLCGNNVYKPSTVEKTNQVEKMPPSKQ 
IWELCVELIiWQ/SNRGIAGSGAHACNPG 
TLGGQGWRIMRSGDRDHPGQHGBTP 


2432 


16333 


A 


2449 


406 


144 


GCSEPRSRPCTPAWVTSETLS/RKKRKR 
RKKKS CLLRAILSTSPELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATCSGSHS 
WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAIFLTPRPKVPLTPITPtiAFNPGHQE 
EIPLPKKKKKI VRAI YDKPTANI XLHGQ 
KVEVFPLKTSTRQGCPLSSIiliYQHKTRM 
PSIi\PiiIiFNIVLDVIiARAIKQEKSIKGI 
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nucleotide 
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amino acid 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=G!ycine, H=Hist!dine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R==Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


19 


DLGRMTAGSGDQRCAVGVKLLS PSLGAA 
GKGPKHSBAEASLPRNPGSYPTRQAQWPQ 
SSUILKGTIPDTTSLNTPWKMLSSLKVP 
SWQGAVAHDCNHSTLGG\RGGWITRSRV 
RDQPGQHG 


2435 


16336 


A 


2452 


3 


396 


FKLiCLMIjIYHSENPRALKNyTKSTLSML 
YKW15NKAWMMAHVFTTWFTEYFKPIAET 
yFWGKKKITLKIIiPLVDNAPGHPRALME 
M YKE INV/ VQ I C ILQPMDQGVI STLRS Y 
R\KNTPCKDIAAIDGDSSDRT 


2436 


16337 


A 


2453 


115 


411 


KGGNFPPPGG6GEKKIW/ PNGGPPPRGK 
RKPPPPPPGGGGKGGHHPPPGPIFFFEK 
KKKGLFGG/PGGAPNSHPKRNPPPWFPE 
GGELTNPPTFFFEGGAF 


2437 


16338 


A 


2454 


3 


114 


HHV/GQAGLELLTSDDPPASVFQSAGIT 
GVSHRIRSVS 


2438 


16339 


A 


2455 


397 


1 


WSWQK\NRRIDQRSNIESPEMNPRMYGQ 
VIFDKIAKNTEWEKDSLFNKWCWKNWI/ 

lkrmkldhphltpXkqkskwmkdlirpe 
teklpeetggnphdiglsndfXldltpk 
aqgakvntdkwdni klknfptrp 


2439 


16340 


A 


2456 


424 


3 


PKKKKIFFPPPGFKIFFFLRAPPFFFFL 
CLSHFLLNRSRSLSCTTSCCVSTIPTSL 
CNKSSGVXCGLHCSLIAI/CSLIHLTLC 
PFCVLLVQMCDTVCVCVCVCVCDTVCVC 
PCPYGTLDIAFKHFFSRWSIiTLVAQAGV 
QR 


2440 


16341 


A 


2457 


266 


379 


HWPGAMAHTCNPSTLRGQSGRI/TLRSG 
VRDQPDQHG 


2441 


16342 


A 


2458 


10 


409 


SRTGPNPRAQTDRPWCVAFACFELPLW 
RSVDSATREAEAGGLL\DPGGRGCSELQ 
LCHCTPAWV/TSETIi 


2442 


16343 


A 


2459 


184 


387 


IVHFQMHKMINVAYIIPQFYSFILEIQS 
HSVTQAAVQWHSHS SLLPLPPGLKQSSH 
/LSLPSSWDY 


2443 


16344 


A 


2460 


110 


1 


KNRVS FFF/GS FETES RS VTQAGVQWCN 
LGSLQPPTP 


2444 


16345 


A 


2461 


380 


2 


FFFFFSETESRSVAQAGLRT\QWCNLSS 
LQAPLPG 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVI PGTWGGQGGGFPRF \GS 
KPGFTW6NPPFLKNHKNYPGGGPPPVIP 
NFLGGKPGNFFYPGGGGFQ/SGPGAVFP 
PPPGQRRSIFLPKKKKERKLIIiPPSLEQ 
RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA 
VTQATVQ/WMPSRLTANPTFRTQGILLP 
PGLKGS S CLSLPSS WDYRHV 


2447 


16348 


A 


2464 


395 


1 


RLRRENHLNPGGGGCSEPRS /HPAWQQS 
ETTSQKTKTKTKQKALASDTVIiSPRQSS 
EERFHLSLFHSSFVTPFGIFSFLTTSPH 

PILPVWLAPQLPAIYCYGDSVIKSWFQL 
WLSKELQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEASPKTSWDYRHVLPCLANFFXFF 
VKTGL/HRAGLELLTSSDLLCFPKCWDY 
RH\DRSTWPLSSVFGSIFLVYYWPLYLI 
ETNMLTTL 


2449 


16350 


A 


2466 


1266 


1473 


YFVPQSTQNHAAVFRVGSLLQEGCGKIS 
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G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, Lr=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '^=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KliYGDLKHLKT \ FDRGMVWNTDLVETLE 














LQNLMLCALQTVNG 


2450 


16351 


A 


2467 


49 


356 


VQVILLPQLiLRRLRQENCLNPGGGGCSE 
PRWSHC CP / PAWVTEQDS I SKTKK 


2451 


16352 


A 


2468 


115 


3 


LSHTKWSAWPGAVA/ STLGGRGRQITRS 
GVQDQPDQHGE 


2452 


16353 


A 


2469 


3 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLIPGFFGGKGVSPVGPGGFLLPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


• 2453 


16354 


A 


2470 


404 


218 


FAQWIKWGALSSPKPPFPGFKLFSRPS 
PPSYWDYR/RRPRPPRLFFFFFIiRQGIiA 
LLPRLE 


2454 


16355 


A 


2471 


386 


258 


AGGLLS PGVQGYNKI.LSCPCKPAWTT \ S 
ETLSQKKKKKKLPFY 


2455 


16356 


A 


2472 


2 


364 


HHTQ 1 1 FVFLVETGFHHVGQAGVEIjIjTS 

gdppaswdygrXghrtwpyshifnnl 


2456 


16351 


A 


2473 


1 


292 


drgcseptschctpvwat\setvsqkkk 
kickkktpllrgpekkngnppprgffrkn 
pgfkrkkggapppglfkigrkkspvggh 

rhtfilggplprf 


2457 


16358 


A 


2474 


1 


307 


SDSCSSIVRRACFPFTFYHDCKFPEA/S 
PVMLPVKPVEL 


2458 


16359 


A 


2475 


2 


596 


mknaediltmevlkstmkqeleaa\qkk 
hslwellripnickricflsfvssssss 
ssssssssssssssssssstkitaw/lp 
pleasfyrstclmparallfastipfwg 
ltlhlqhlgnnvfialqtlfgavtliianc 
vapwalnhmsrrlsqmllmfliiatclla 
rifvpqgekssqveerkclslfsqglpw 

SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYIiANI/DL 
NRElESNrilVEYFNTSLSKMDRYRSSR 
QNIDKETVDLiKYS I /HINQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 


16361 


A 


2477 


12 


362 


HHEPG\GGGCSELRLCHCTPAWVTELDS 
VSKKKKKKKSQKKK 


2461 


16362 


A 


2478 


401 


1 


PFFFFSETRSHPVA\RLECSGAISAHCN 
QCI 


2462 


16363 


A 


2479 


3 


353 


YMCVCVCACVCICVIjPWMCLCVC/ cycv 
CVCWYLCVCIiCVSLCGHQHIiAVSGKRSQ 
P PSHES FKTSLLIiWVFKWLPPLLCAWVP 
QVSQNMPLDGTRTLNNVSLPDEKVHDLL 
LPLTAA 


2463 


16364 


A 


2480 


57 


361 


MNESLIiSCKRWSLLAFSHI / CVFCLICS 
TDISAIiCVTVCACMHVCGCRCVCMCLCV 
CLCLCVCRCMYGRVCAPMCLCVCVHCVL 
ARMCACGCGCVYKAECPI 


2464 


16365 


A 


2481 


3 


381 


YMCACGYICMHA/CVPVCMCVCTCLCAH 
VCL/CCMCVCTFVPVCMCVCICVCLCVC 
TPIITLPFLSQERISFCTDLRSFRATAK 
RSHEEVKNCVYLQTI WNQYLI I FYIiQNP 
NFGWAQWLMLVIPAVWE 


2465 


16366 


A 


2482 


2 


380 


IHVLGNCLYFSEPQFPHWQQRITRSFLE 
GYGENLMRLHMS YIHMCLHI CAYI CI / C 
/LCVYI CVLLCAYI CVCVYLCVCI CMC I 
FVYLCTHICVFVCVLVCIYLCV/C/VCV 
CICVCI 


2466 


16367 


A 


2483 


296 


1 


NGTTIRRMIiPIFSRYRPGMPGLVQCTIQ 
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nucleotide 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L— Leucine, Methionine, 
N=Asparagine, P=Prol!nej 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SCTADTSVPYSHGFIYFFIYLLRDRVWLC 
YSGWSAVAPSWLTITSN\YRLKPSSCLD 
LLGSWNYRQMPPCr 


2467 


16368 


A 


2484 


1 


319 


NTVGLCVCV/ CLFVCVIiSLCV / CLSLCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWL.CVSVCVSLCVSLCVCVYECVIS 
STKRSLOAPSRAEAAKLPRNWAPAKQSH 


2468 


16369 


A 


2485 


3 


334 


nhfiisidi/dkhfrkihqpfvpkaitk 
lgikgnflnllrgtsvkstgninmXnge 
kpnnlplrlviqgdvlfhlllevliasav 
k2ckkkkkkkkkkkkkkkkkkkkkkkkkk 

KK 


2469 


16370 


A 


2486 


17 


409 


CLDNKKPRKEYCQKTHLIKKNPLLSRIC 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKHMKRCS IS/li/S IRKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


LECNVTIMAHCNLRNIjGSSDPPTSAS \Q 
TAGIHHSLQHFFFIiFFVETGFCYTQAGIa 
ELGSSDLPTIASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKSLYLRS fflfvcmflrw 
SL/DSVSPGLTNCHHIiGSIiYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 


A 


2489 


1 


404 


KAGAQMGIKHPQHS SRLRLRRPSPRLPS 
SQKPLNTHYFPS FS FCAHRHTHTHTHTH 
THTHTHTGV/HLHPHT 


2473 


16374 


A 


2490 


373 


146 


YGLERKISFKIL/LFVGKSPSYPRALME 
MCKINWFMHANTTPTLQPIDQEVILGP 
GTVAHACNPGNLRGQSGRNA 


2474 


16375 


A 


2491 


427 


85 


KTPPPP\RGPFFFYFFFFFKKAKNFFPP 
PPTPPLGKKKNP\PPPEKKISPPPGFPP 
PPFKKGPPKTLFKTPIiKKKKGPGAPQKK 
PFFFKTPPPFFPFFF 


2475 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 
FFFFXXXXXXXXXXXXXXXXFFFFFFFF 

FFXGCLFIYLFI 


2476 


16377 


A 


2493 


3 


135 


I SAHCNR \ LLLi PGS SNSTASAS PVAE I T 
GACNNNRQDFS F 1 1 LQ I 


2477 


16378 


A 


2494 


37 


420 


AHVCVCTCVCVGICIiSRRVWCAHSHVC 
AGIFLHLINQTFTTHL/CVCKVMALRVT 
P S P CLQGA\QTQLAGQCMCAKVCMNFM Y 
IFIBGHICPQTCSCLGEPIRGGLiSPLVC 
IC/MCAGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQITGGGGRGRVSYFLRG 
VSQENPLNPGGGGCSEP\RPPGWGTKNP 
LPQKKKKSSKQQEGRGED 


2479 


16380 


A 


2496 


138 


439 


RTFILGDHHHEPSAVHFHLPRRKDCTHE 
TGAAPPPPALGATVPPSVSVCLTGGGPQ 
ITPPPPALGARDLPSVSVRLTMGGPQIT 
PPHPDQQPL/SPPSVSVC 


2480 


16381 


A 


2497 


118 


382 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSACLI CVCLCQ/ CDLIIOCCV 


2481 


16382 


A 


2498 


164 


439 


VKSFGASiynRACVCECVYMCVSMCVCVCV 

C VSACL I CVCLCQ / CDLINKCV 


2482 


16383 


A 


2499 


1 


411 


ISTS FYTDTINRTALiHFAVGRNHLSAVD 
FLLKHKARVDVADKDGMI^ALHFATQSNH 
VRIVEYLIQDLHLKDLNQPDEPKSSPLH 
LWINNHI T VWrS LLSAQHD I D I Ii\ IRS 
SKPPLHVAADRGNVKLVE/LLIiKAGCD 
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Predict- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyiaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan5 Y=Tyrosine, 
X=tJnknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide Insertion 


2483 


16384 


A 


2500 


80 


404 


RTAAAVS T VS FP QDFEGQ S P KCTQGVRE 
ALRQIK/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAPTALAMMDY 
PYPTDFLGPLPANPVKGRRRLPRRRERP 


2484 


16385 


A 


2501 


215 


1 


aaiqqgsliacshsvppattpraytpvpp 
qllvrnf\ypktlelrsqlrcarrfpre 
tgadcrhagagrqtk 


2485 


16386 


A 


2502 


423 


214 


WQAQVGGLLEP \GGRGCCELRSCHCTPA 

wvtXgeidsqnnpkrqqnktkqkmgpgv 
pgswgspraggltv 


2486 


16387 


A 


2503 


294 


1 


lhwwqrqrcppppppplppppplppppl 
plrXhriagagqtgakirwgrwrvga 
glgvaaa/dsres cpdgaaagggegggd 

SAATAGPGGGGGGGTK 


2487 


16388 


A 


2504 


331 


411 


PFLKTGNKGAPPPTKIFPLFPKKKGG/W 
PGAVAHACNPSTLGGRGGRITRSGDRER 


2488 


16389 


A 


2505 


407 


195 


GGRGCSELRS CPCTPAWVT\GEVLSQKK 
RKSSNKKSRCLEDQAEVGGERQIiVQVSV 
PSTCQRRVGAWFIYL 


2489 


16390 


A 


2506 


1 


468 


RQGSMNKLETERQ I KKAPARNPERERER 
ERESKKGEDRHTDI CRP/RERERQNYRD 
RQ 


2490 


16391 


A 


2507 


27 


380 


FVCIiCPAPRGGAYRGRQASLS CGGLHP V 

raswllclpnqawamagapppasllpcs 
lisdccasnqrdsvgvgpsepgvgynlm 
vrrflsrsekrnirvgvtrfsrcvchpf 

Li*LGKG 


2491 


16392 


A 


2508 


396 


234 


rqenclnpgggasseprsrlctpawate 
rdsisn/qnkqtnkshilraafsrpscy 


2492 


16393 


A 


2509 


1 


421 


akkikppfplkkkkkpgrggppppppp/ 
akfgretplnpggkgsikpklgpppppp 
ggppnlppkkkkkkaipk/plvpshvks 
tgfpps cs /rsallraftq 


2493 


16394 


A 


2510 


197 


3 


tgppdfnfffflemkshsvaqagvqwrd 
lgslqppspqfkqfyywglqga/lppcl 
nkicifsrggv 


2494 


16395 


A 


2511 


114 


454 


qgplekkgapfknkffktgvlffgsqni 
ppwwfkkgfsrtpfeikkknppgilggs 

P KKFF FFFFFEMVLLHHPGWS ALAQSLL 
TTASDF/LRLKQSSHLSLSS 


2495 


16396 


A 


2512 


3 


495 


FFLSRGLFIHLESAPAIFQCLLFWFLV 
FVATGSHYVPQAGIiKLriASNEPPASNS P 
KCWVYRHEYPAIiKCIiCFGLCLF /AFLP 


2496 


16397 


A 


2513 


25 


437 


PLLFSPPAGHAMEEDILPPAPSFLPYFP 

FYWLVWVKI lYLYVIiAQAATTNYHRLSG 
LNNKHLFLTILEPRMP/SIKVPA 


2497 


16398 


A 


2514 


2 


256 


QAPSAEGEMTSYVLIiAYLTAQPAPTSED 
LTSATNIAKWITKHS\NAQGGFYYlSrQDN 
LPYSLCSGRNESTAFGTNGQDI IHPVHS 
3 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCSYYQS CCTDYTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNWAIRHEQ 
A\GFTSVISDLQAQI IWISEQKHAMSFV 
EEMTVLEVIL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAE YTASLKAS CWMGDLAE 
SPSLDPQVAMGTPPAGPCGWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRWSPQVPP 
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sequence 


SEQ ID 
NO: of 
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sequence 
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SEQ ID 
NO: in 
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26 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=I$oleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion^ \=possible 
nucleotide insertion 


2500 


16401 


A 


2517 


401 


2 


ggeaairgqtkpqlegkapecsecekki 
vrs\prlirrqrtptgekpyeceecgks 
fsrryrlaqhqrXtragektyecnecgr 

GFSERSDLXNHYRVHTGERPYKCDECGK 
NFSQNSDLVRHHRAHTGEKPC 


2501 


16402 


A 


2518 


398 


1 


ETFGK\SGGRS I VPGQFLAVGPKGRAVM 
ISAIEKQKLVYIIiNRDAAARLTISSPVB 
AHKASAIiVYHWGVDVGFBNPMFACLBM 
DYEEADNGSTGEAAGNTQQTLTLYELDIi 
GLNHWRKYKEPLEEHGKfCI 


2502 


16403 


A 


2519 


384 


3 


GGGID/SDASLVIAGVRLEDEGRYRCEL 
INGIEDESVALTLSLEGEALPLPPHSCV 
AAGPPRLGLPGLIiPS ISSAPLGTPAPSP 
RPRRPSSPSAPIRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 


1 


426 


GDRQMITAIiIiRKLKQQSRESVEENRPRL 
IiKALKELGDFYLELHWDPQSWVPLLSRI 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQRGDLS FI FNGDAAPSESVWLDNEPK 

VYQRLRHEES/QEKNRTQVDCLTP 


2304 


16405 


A 


2521 


2 


425 


AIiPIGRMPIMVRSSR\CVLTGKTPAEFA 
KIiNECPLDPGGYFIVKGVEKVILIQEQL 
S KNRI I VEADRKGAVGASVTSSTHEKKS 
RTMMAVKQGRFYLRHNTLSEDIP r VIIS 
KPMGVESAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


IQSFSWYGYHLCSQGILSERVSASKFP 
LLFFFFFKKGGPPFMP/QGGKTKGPLKP 
GDPQSPGIKGFPCIxAPPNKGKKGGPPPP 
RGNFLGFKKKKRFPQVGGGGSKPPREGK 

PP PLiAPQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


GAPRSLSEKHRQLMGMINQLSSFREQLL 
HAHYEQKKIiAASQIEKQRQHMKLGKQXQ 
QEQIARQQQRLIQQQHKINLLQQQIQVQ 
GHLPPLMIPVFPRDQRQLDALAQQGFLL 
PPGFSYKAGCSDPYPVQVIPLY 


2507 


16408 


A 


2524 


2 


369 


NARCLILRAAEYMAACVYVCISVFACMC 
FCVRVCVPVCVSGCLCVCVSMNVHLSLC 
/VEQCACLWMCVSLHVCLYICACLCVSV 
CEAVGLHLESHCRGGSRPFPWGPAFSL 
SSCLILAPPSVT 


2508 


16409 


A 


2525 


59 


380 


RNRNFNKNSSFFFFFKKGPWGGGPGGTK 
GPKQRQGEP \NHWGQiQTFPGQTRKPRKP 
TPGPPPPKKIPPFKPLEKKGPGQGPQGG 
QKPGGQKKPPGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLHVARSEMD 
KERVFQATRKSSP/CFVPI.LPWPT 


2510 


16411 


A 


2527 


445 


3 


RWLGIiAIiIALKREIiKKGDIiXPEMRWWDS 
SI IPNG\FDIjTEETPKREDYFGIANLVE 
HPAQLNPPVDNDTPVTLGVYLTKKEQKK 
LRRQTRREAQKELQEKVRLGLMPPPEPK 
VRISNLMRVLGTEAVQDPTKVEAHVRAQ 
MAKRQKAHV 


2511 


16412 


A 


2528 


1 


478 


RPTRPKRKLKSHRLQSRQESKKVRVL/T 
NAEDNEMEEETDDGPLIiVPRVKVAEDGS 
IILDEESLTVEVXiRTKGPCWEENDPXF 

ergstttyssfrknyyskpwsnketdmf 

FIxAISMVGTDFSMIGQLFPHRARI E I KN 

kfkrbvyasaiedqatsvht 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO; in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyia!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioiiine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argjnine, S=Serine, 
T=Threonine, V=Va)ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


2512 


16413 


A 


2529 


385 


1 


QKKCEDLK\DQDNPIVRPPPTPGSCGHG 
GWINTI SKP YWQLNWRGFYYL YGAWGRD 
YSPQHPNKGLYWVAPLNTDGRLiLEYyRL 
CNTLDDFLLYINAREIRITYVQGSGIAV 
NYNNKYVSMYNTVW 


2513 


16414 


A 


2530 


1 


1228 


FRATLRPETMFGQTNCWVRPDMKYIGFE 
TVNGD I F ICTQKAARNMS YQGPTKDNGV 
VPWKELMGEEILGASLSAPLTSYKVIY 
VLPMLTIKEDKGTGWTSVPSDSPDDIA 
ALRDLKKKQALRAKYGIRDDMVLPFEPV 
P VIE I PGFGNLSAVTICDELKI QSQWDR 
EKL2%.EAKEKIYLKGFyEGIMLVDGFKGQ 
KVQDVKKTIQKKMIDAGDALIYM\EPEK 
QVMSRSSDECWALCDQWYIiDYGEENWK 
KQTSQCIiKNLETFCEETRRNFEATLGWL 
QEHACSRTYGLGTHIiPWDEQWLIESLSD 
STIYMAFYTVAHLLQGGNLHGQAES PLG 
IRPQQMTKEWG\YVFFKEAPPPKTQIA 
KEKLDQLiKQEFEFWYPVDLRVSGKDLVP 
NHLSYYLYNHYAMWPEQR 


2514 


16415 


A 


2531 


335 


2 


KKKAIiFFIHPPPFVNGRPHKNPKIKSLG 
SPTPPSLIFLKKKFCPWQDGVRWLYFG 
SLQSLPSRPPPFFCLNLLSSWEYRGLPP 
RPGKYFFPFI PFYFIiVEM\GFTVIiARMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASRSSK\W 
SPLPRALHLTDAK 


2516 


16417 


A 


2533 


3 


2083 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGIiVPPSPEFAPRSPE 
SDSQSPEFBSQSPRYEPQSPGYEPRSPG 
YEPRSPGYESESSRYESQNTELKTQSPE 
FEAQS S KFQBGAEMIjUSrPEEKS PLN I S V 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLQNPLGIjTGALRGPG 
RRGGRARGGQGPRPNI CGI CGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKP YKCPHCGKAFGDS S 
YLLRHQRTHSHERPYSCTECGKCYSQITS 
S LRSHQRVHTGQRP FS CG I CGKS FS QRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VliAIHARTHLPGRT YS CPD CGKTFNRS S 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKGDDCGKAFS\R 
ASDLIRHQRTH 


2517 


16418 


A 


2534 


434 


3 


APLHSGKRSPTKCN\ECG\GAWNRSSLL 
DRHKI IHSEENPNKCEECGKAFKQASRL 
TIHKI IHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSNL 
TNHKRIHAGEKPYKCKECGRAFNISSNL 
NKQECI 


2518 


16419 


A 


2535 


46 


454 


ps tfs s kvmdkqtlcssqatsntsr yaa 
alyrqgsiypkemktcvhtkpytwhss 
ifmiakkwkqprcpsaXdswrnrmwsih 
amgcyltmekneavmlprr/wslenivl 
sersh 


2519 


16420 


A 


2536 


13 


476 


ALKTYKYSHKKAPNPKKVYGKCSQKDPF 
PINfiCLPTEKLHLCDKIGBGVFGKVFQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine» 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=:Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y— Tyrosine, 
X=Un known, *=^Stop codon, /=possible 
nucleotide deletion, X^possibie 
nucleotide insertion 














lADHTPVAIKI lAIEGPDLVNGSHQKTF 
EE ILPE I I I SKELSLLSGEVCNRTBGF I 
GLNSVHCVQGSYPPLLLKAWDHYNSTKG 
SANDRP/DPFKDDQLY 


2520 


16421 


A 


2537 


162 


296 


yVCQRYKIi\NPFLPSYTKINCKWVTDIN 
VKPTSVKLLQEKRRKSIi 


2521 


16422 


A 


2538 


495 


348 


FGW\HAFIVKEPRVEKLCKASARAKPQP 
PAI I AKTFKAGGITGQYAQAAIi 


2522 


16423 


A 


2539 


480 


63 


ARSEAWIS lADAYMASPGDRIAQLLIiLP 
YIKVGNSBIKRTGGFGSTDPTGKAAYWA 
SRVSEIJRPACKAI IQGKQFEGLVDTGAD 
VSI lALNQW/PKNWPKLKAVTGLVSVGT 
ASEVYQSTVILHCLGPDNRDSTSHYSKT 
R 


2523 


16424 


A 


2540 


2 


509 


NVDADDVRLAIQCRADQSFTSPPPRDFL 
nDIARQKNQTPLPLXKPYAGPRIiPPDRY 
CXiTAPNYRLKSLIKKGPWQGRLVPRLSV 
GAVSSKPTTPTIATPQTVSVPNKVATPM 
SVTSQRFTVQIPPSQSTPVKPVPATTAV 
QNVLINPSMIGHIOSrrLI /TTNMVSSQNT 
A 


2524 


16425 


A 


2541 


501 


1 


QWQWEDPGPGPRPREEASSYKCELREQ 
\LPQYVRDFFRKKAE\SGMDSSRNLEKL 
AERFIiAKTCSTKDQQFKKDQNVLSPVKrC 
CHLLLTQVKRESRAHTTLSDIYIiNNIIP 
RFVQVSEDSGSLFKKVQRYFFTEVRGWS 
NDTI FKIIiLD IMLI TWVTQLS VHQTP V 


2525 


16426 


A 


2542 


473 


287 


EKDFNLPPKDLRL/KTSDV/TSTKENEF 
EDYCLKRELLMGI CEMGWEKPS P IQVC 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKAELPTHGIQI 
NSCS VRIi/VDIKKEKPFS ILKVEGQAQA 
RTHLNRAFDI IVLTRGG 


2527 


16428 


A 


2544 


407 


1 


ckkicylihywweykmiqplwktvwqfl 
Kkklnilydpavilydiypeely 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFLSL/SVMPK 
IGSVAGINYGLVAPPATTGETXiDVQM/K 
GEADTENH 


2529 


16430 


A 


2546 


2 


365 


FVWVDEVGGEALGRLLWYPWTQRFYE 
SFGDLSTPDAVMGNHKVKMGKKVLGAF 
SDGLALLDNLKGS FATLSELHCDKLHVD 
PENFRLLGNVLVC VXiAHH/ FDKEFAP P 


2530 


16431 


A 


2547 


375 


1 


GFACRRMQKRVREVSHAEAESAASKKVR 
ANGS GKQNEGMNVTWTATIiALSQAVI S S 
ATYTQMQ PHSL I QQQQQIHLQQKQWI Q 
QQIAIHHQQQFQHRQSQLLHTATHLQGA 
\QKQKQQQHEWR 


2531 


16432 


A 


2548 


3 


376 


ELGSDVA\GAEAIiVDRRQERKGE IDAHE 
DSFKSADESGQALLAAGQYASDEVREKIi 
AVLSEERAALLELWELiRRQQYEQCMDLQ 
LPFRDTEQVDNWMSKQEAFLLNEDLGDA 
VDS/ IKEILKKHE 


2532 


16433 


A 


2549 


360 


1 


RGEMLWTVXNNRFLKNPVPGKIEPFKSH 
SLYPPCYVHDVSFWIDQKKGFDELiEFHT 
VGRAVSQDTIISIQFIiSRFQHPKTQQVS 
liCYRIiTCQTCDKALTQQQVASiyiRSQIRK 
EIQQHQY 


2533 


16434 


A 


2550 


2 


403 


WAEEDTELRDLLVQTLENSGVLNRIKA 
ELRAGVFLALEEQEKVENKTPLVNESLK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylaianine, 
G=Glycine, H=Hist!dine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y-Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KFLNTKDXLQGLEGRESNLARDLGIIBAE 

GTVGGPLLLEVIRRCQQKEKGPTTGEGA 

LjDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 


1 


409 


VPRNPTPLGGPGGP ILRSREGGPPGLPR 
EPPSVLKKREPPTRGGGRPPGPPLPRRV 
RPEKWVNPGGQTFPSPKLRPRG/HHPGG 
QTKNPFPKKK 


2535 


16436 


A 


2552 


256 


1 


HEMDGTIiLGPFPPPGSSKGPSLTGPPIIi 
FQSPGP/APHTPSSSPANLKT/CTPVCP 
SHLPW/CCPLCLPMRL.PWSPVPPVSKSS 
PPALY 


2536 


16437 


A 


2553 


398 


3 


ARQQQQLLQQQHKI /NWLQQQIQVQGQL 
PPLMI PVFPPDQRPIjAAAAQQGFLLPPG 
FSYKAGCSDPYPVQMIPTTMAGAAAATP 
GLGPLQLQQL YAAQLAAI^Q VS PGGKLPG 
IPQGNLGAAVSPTS IHTDKRCI 


2537 


16438 


A 


2554 


20 


517 


DRPPS TKRRDTPQLRGAPDLS PRGPAP V 
PECPEH/SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCPLPEALSPPAP 
FRSPQSYLRAPSWPWPPEEHSSFAPDS 
SAFPPATPRTEFPEAWGTDTPPATHRSS 
WPMPRPSPD 


2538 


16439 


A 


2555 


361 


3 


KSSQEALEEALRQRLEELKKLCLREGEL 
PGKLPVEYRLDPGEDPPIVRRRIGP\AF 
KLDEQKILPKGEEASIiERIiEREFAIQSQ 
ITEADRRLASDPNVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


16440 


A 


2556 


470 


2 


LKAAVTAGLEVPSDVSDRAFE \WIiSAFP 
Ii\DSPYSIHHPRRIQVSSEKEAAPDAGA 
ERITADSDLAYSSKVLLLSSPGLEELYR 
CCMLF VDDTAE PRE T PEHP VKQ I KFLLG 
RKEEE P VL VGGEWS P SLDGLGPQADPQV 
KVSNAIRCAQAQTGTV 


2540 


16441 


A 


2557 


2 


315 


WAEEDTSLRDLLVQTLENSGVIiNRIKA 
ELRAAVFLALEEQEKVENKTPLVNESLK 
KFLNTKDGKMFSLFLFIYL\EPL3SrLDTI 
WEKSGISGLFFVFNCYRSSF 


2541 


16442 


A 


2558 


372 


3 


CNISSHFANKKGQDVIVAARNVMTSQIH 
HAV\KI IPGFNINVPGLPPPDEDTELEV 
QKVSNPQYHEVMNLELENTLDQHSYSLP 

TCRISEYVKKLMELAYHSLLEAASSSDQ 

CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLGI AHKSVLPTLTDKFN 
TIPAKTPMSFPKEPDKIjTPKFT/WKNRT 
PRTVK 


2543 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKBLMS IPGI P 
GPVQVPGGGLST\6MRGFPGSSLGFLTI 
PGGALIPFSPAFFSRVQGDLSPRNTGPG 
QKPG 


2544 


16445 


A 


2561 


431 


51 


RKIYRVYERENFRVEIMFRFSHTSKKVC 
KGlSrv/DVAKFIKIiHRDGWHVIiOTQCACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK 
IGDKVRVKASVTTPKYKWGSVTHQSVGV 
VKGWI I WVXKFLMIiTFH 


2545 


16446 


A 


2562 


1 


455 


PEGIALEWTESTGKEREHTFQPGDNVE 
VCEGELINLQGKILSVDGNKITIMHKHE 
DLKDMLEFPAQELRKYFKiyiGDHVKVlAG 
RFEGDTGLIVRVEENFVILFSGIiAMHGIi 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A==Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidlne, I==Isoleuctne, 
K=Lysinej L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S==Serjne5 
T=Threonine, V==Valine, 
W=Tryptophan, Y- Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVLP P PLQLCS ETAS /SIN/ VGGQHEW 


2546 


16447 


A 


2563 


407 


1 


KWDHPLiSKCEVPCGGS I IS SNGPVSYPG 
FPSPY\TSSQDCVWMITVPSGHGVRLNL 
SliLQTEPSGDFIAIWDGPQQTAPRLGVF 
LRSMVKKSVQSSS I QVLLKFHRDAATGG 
IFAIAXiSAYPKTKCPPPTIHPCI 


2547 


16448 


A 


2564 


382 


1 


TQEVEVAVSLDRAPAFQSGCQSETLSQN 
SNSSNKTWILDHPGIjSDFLDQRPSACTV 

ftpreqvsshtrmf i aalft i aktwsqp 
/kyppvigwikkXmwhiytmeyyaaikk 
defms fagtwmkleamy 


2548 


16449 


A 


2565 


429 


0 


plqsscqtschqncsrbctslhfpgdvla 

TPEQVGGS PAQVP I P YLDDDI PLLEVEQ 
EPVSIiELGDVSIiVSVSREGLQPASITGS 
RGHLIVQIiQELLHHWVLWSAVKSRWVIV 
GLFVS riiILSLVFAIRLiR\SASRAPVLL 
RP 


2549 


16450 


A 


2566 


464 


1 


VIHGVI \NPFVHGDQYKKKFPLK\FYQE 
I YES PFVTETGEYYKQEASNLLQESNCS 
QYMEKVLGRLKDEEIRGQKYLHPSSYTK 
VIHECHQRMVADHIiQFrjHAECHNI IRQE 
KKNDMADTMPVLLRAVS KGLLHMIQEHVL 
RALGRIPTSYVRMQS 


2550 


16451 


A 


2567 


3 


178 


YNPGGGVCSDLI S SHCTPVWVT \ SETLS 
QKKKKKKKKKKNFLPETRERALPGICKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
QIRKRNS /WLrYWAHTCNPSTLGGGGGQ 
ITRSGVQDQPG 


2552 


16453 


A 


2569 


170 


430 


TSSQLAPQCLAV/VSGPLAACPEIiTSAT 
SPWLQVRTNAMASPLLKFSAEDLLFKAA 
LSQFCVIMIjNAKLSVQKYEKXLISAFSD 

SR 


2553 


16454 


A 


2570 


3 


463 


CRFFGYSTAAAPMTPSSGGSTLSGITAP 
AVPNIPSPIGVNGFTGIiPPQANGQPAAE 
AVFANGMHP YPAQS PTVADPLQQAYAGV 
QQYAGPAAYPAAYGQISQAFPQPPAMIP 
QQQREWPEGCNLS1YHLPQEFGDS\EX.M 
QMVLPFGNVISSKV 


2554 


16455 


A 


2571 


3 


424 


LKTMKGGTGNGLEIMLDIQQDEYLPVWG 
ETGTS PTSGAPLHGS RPQP / PARGFLGF 
TVRPG 


2555 


16456 


A 


2572 


1667 


2046 


YI PFFTAFLWAAIiTFQVTTTLAPLALLV 
RSAKMMRASHDKPTANITIiN\GKTGRAS 
KQRREERIJRQEVKLSLFTDDMFLYLESP 
I VLAPKLLLIiINFS KVLAYKINVQKLIiA 
VLYIKSSRESNEEHN 


2556 


16457 


A 


2573 


1 


399 


FAISQDHPALPSRPPSIiHHPKPGTLTFH 
PDLPHQATCSRPIRHQRTWPEDAPLAKA 
DTVSPAEHPPAAATKAP/TR/PAPDKPG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


16458 


A 


2574 


2 


452 


AKVWEMKSPMRKGHTLIiKNKEEKLNQLE 
SSLWEEASDEGSLGGSPTKKAVTFDLSD 
MDSLSSESSESFSPPHLDSTPSLTSRKI 
HGLSHSLRQ I SSQMSS VLS ILDSLWPQS 
PTPLLASMPAQLPP\RDPKSTPTP/Syc 
GRCRGFSLTS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L==Leucine, M==Methionine7 
N=Asparagine, P=ProIine, 
Q=Giutamine, R=Arginine, S^Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 


2558 


16459 


A 


2575 


2 


369 


TLVYPArTFILLSICICYWIVTAVPLTT 
SGVPVYKVIAPGGHCIHENQTCDPEIFN 
TPEIAKACPGALCNFAFYGEKSLYHQYI 
PTFHVYNLFVFLWL INFVI ALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


2576 


406 


1 


RQEGTGPSYLLLVGITPRSFWGFSPPLG 
PSGKNP IKTFGGPLQGFFFRVQMGVLLP 
LPGPSGN/CSIKVSAUSISSFSPSGVNPG 
EASLPWF/CFFEMESRSVAQAGVQWQPP 
PPGFKRFSCLSLLGSWGYRRPPLiV 


2560 


16461 


A 


2577 


3 


410 


YISPF YITHMRAHTNLPGP F/ KUSTQRAD 
ALVSAAFADAQTFHSLTHLNAAGLRKRY 
GLSWKQVKEIVKHFSAGEVLHLPHQGAG 
VKTPRGLS PNS I WQMDITHI PTFGKLS FV 
HVSVDIYSHFIWATYQTGBATAHVK 


2561 


16462 


A 


2578 


386 


1 


TERIRQRQYQRETEKDKRYTERQRECTER 

iresdrdrsrqnqierdrrat\rerdre 
kqresdretyrerewqieterdrkrqsd 

RDRETQRETGRIRYRERERQHQTESDRD 
RERESDRETQRHRECI 


2562 


16463 


A 


2579 


2 


432 


llyfakrygaafgeccqaadkaacllpk 
ldelrdegkassakqrlkcaslqkfger 

AFKAWAAARIjSQRFPKA*FAEVSKLVTD 
LT*VHTECCHGDLIjECADDRSDIAMYI c 
EMQDSNFSKLNECCEKPLLEKFHCIAEV 
END 


2563 


16464 


A 


2580 


3 


426 


MLLNDALAIREKTLGKDHPAGAATLNNL 
AVLYGKRGKYKEAKPMSKRALE IRKKVL 
GKDHPDVDKQhNNhALIjCQNQGKTEQGE 
YYYPRALHIYHTKLGPDDPNAANTKNDL 
VACYLICHGTViCQA* SLNKENLTS AHE WD 
V 


2564 


16465 


A 


2581 


10 


389 


KLAG * GL WGHTL I TNPLTE PLT YPFLGL 
YL.*SIIITSSICIiLQTDIj*ALIAYSSII 
HIALVITAXLIQTP*SFTGAVIXiIIAHG 
LTSSLLFCLTNSNYERTHSRI I ILFQGL 
QTLRPLIAL* *LLA 


2565 


16466 


A 


2582 


1 


192 


LIPTLAI ITR^GGQPERLNAGTYFLFYT 
LVGSLPLLIALIYTHNTLGSLNILLLTL 
TAQELSNS*ANNLI *LAYTIAFIVKIPL 
YGLHL * LP KAHVEAP I AGS I VLAAVLLK 
LGGYGIIRLTLILNPLTKHIAYPFLGLS 
L*GGQPERLNAGTYFLFYTLVGSLPLLI 
AIiIYTHNTLGSLNILIiLTLTAQELSNS 


2566 


16467 


A 


2583 


2 


438 


QAHGPLAGCRLRPRSPSPVLWRRRRRWW 
RQRRKWKTKTATAAAGMYASWTKACRGIi 
EELINLTRLNVSYNHIDDLSGLIPLHGI 
KHKLRYIDI/HSNRIDSIHHLL* CMTOLH 

FLTNLILEKDGDDNPVCRLPGYRASIFQ 
TFAQL 


2567 


16468 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDLIDTDRGE 
WDSLTPQARR* KEAGLQMFVQIiARFHNI 
VCKBAFGTIiAPLTSE IKSLFGHPPLAER 
1 1 SMLNYFLQHLIGPKMGALEVKDFSBF 
DF*PQQHVSDSGAiyLDLGDEENPC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLSYKYATVIRNLREGTCPEAPSD 
ECKPEKRCALSHHQRLKCDEWSDISVGK 
IKCVSAETTEYCIA* IMNG*ADAMSLDG 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ II> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ II> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine5 D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFVYISGKCGLMPVLAENYNKSDNCEDT 
PEAGYFAVAWKKSASDLTWDNLKGKKS 
CHTAVGRNAC 


2569 


16470 


A 


2586 


3 


413 


MAVESRVTQEEIGKEP*KPIDR*KTCPI, 
MLRVFTTNNGRHHRMDEySR*NVPSSEL 
QiyTWMDATLKELTSLVREVYP*ARKKG 
THFNFAIDFTDVKRPGYRVKBIGSTMSG 
RKGTDDSMTLQSQKFQIGDYLDIAI 


2570 


16471 


A 


2587 


1 


798 


liEVMLMLVKAGADQRAKNQDGMNALHFA 
TQSNHVRIVEYIiIQDLHLKDLNQPDEKG 
RKPFLIaAAERGHVEMI EKLTFLNLHTSE 
KDKGGNTALHLAAKHGHS PAVQVLIiAQW 
QDINEMNELNISSLQIATRNGHASLVNF 
LLSENVDLHQKAEPKESPIiHLVDINNHI 
TVVNSLLSAQHDIDILNQKQQTPLHVAA 
DRGNVELiVETLl4KAGCDLKAVDKQG*TA 
LAVASRSNHSLWDML I KAERYYAWREE 
HHESIRDPSTGFTIi 


2571 


16472 


A 


2588 


2 


285 


AWS lAPSHPHSKVPPGPRRGKAEGRPGA 
AAQAAEQAEVHPPS SGP S PLPARQPP VW 
QIPPTPSLKTTTRRGAQPQHSRKRLA*S 
RSVSVLFRKM 


2572 


16473 


A 


2589 


11 


438 


AYYGIjNWHLGATIjSQKKKKKKKKKKNFP 
RGGGPPRNPHFWGIGGRPGTGPPRGEKN 
GLKNQKKEKFLAPR * KKS TGKPLKP PQG 
KGWGGGPKFPKKKRPPRAGGGSLKPKGK 
NREPFLALiKSItPGVFLGGFGGNPGiyiGLK 
NQ 


2573 


16474 


A 


2590 


313 


391 


VHLVRVKLGL * SRI YKELLQLNNTT * DN 
PI ^KWATDLDRHFSTKYTQMTORLMEKC 

STS 


2574 


16475 


A 


2591 


62 


616 


EVHQGTSVRDSEVRRRPQARGPLMPAER 
AGRQRWLVPALQPRRGGLRR*RGAVRQH 
GAHPHGLLLQDQKI PALPGRKQAGSLHA 
PGTEGEPDHGGDPVXiDAGIQHHRQQRHP 
TADHLNPGEHRRGEAHVRAAV* PAAGAE 
GAAKERRAHQANTAIiQVHRR* LGSFAEL 
RLLRKPGRTS VWP S PM 


2575 


16476 


A 


2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYILIL 
PGF 


2576 


16477 


A 


2593 


479 


313 


QDGLDLIiTS*STRFSLPKCWDYRREPPIi 
KLINLQSSGVGLHVQLFPPSFCFDQLIi 


2577 


16478 


A 


2594 


3 


419 


LTLHSNTLPPPEMSGLT * PPATNAYWTF 
LPSAIRLPPEIPFIAVLLSVSLPDETET 
LSDAHSWRLTEKYERDMYHIiLMSAQER 
LERLFGLHGGTIP I VPTADFHDSISGAS 
DTDIAHSGLiAYTMERSARQIMRTAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGRYRGKTYGGYWSLCT^TVNKALDE 
RIPITSASYYATVTLDHVRNILGSDTDV 
SMPLL*ERHRILNETGKFLVDKFGGSPI 
NCVR*RDNIAHKLMHIjWESPPSYRDVT 
riFEGN*VAFYTRAQIIjVTDTGTVIiEGK 


2579 


16480 


A 


2596 


199 


397 


SPTALNTDDVNPFMGSSSRGHHRQFQTG 
I VSDHPAEAGPIS *DEQELPYAVLHFHK 
VQPLEPKVTD 


2580 


16481 


A 


2597 


154 


3 


MLSLRGFFCLFVCFETGS*YAAQAGLKL 
LGSGSPPISSFQSARITGVSHL 


2581 


16482 


A 


2598 


425 


223 


QITGHGGKCL*SQLIiGKFGQENFLNLGS 
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SEQ ID 


SEQ IJO 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=AJanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenylaIanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Giycine, H=Histidine, I^lsoleucine, 
K-Eysine, L=Leucine, M^Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucfeotide deletion, \=possible 
nucleotide insertion 














RSFN* PKLLPGTSGWVTMLNSLGKKKKV 














TAYLNKWMPTP 


2582 


16483 


A 


2599 


3 


441 


QGPSRDLVSELYQINAFDTPESLLMIGK 
DHSDPIHHTFDHMWRTKEHNKAGWLLLR 
SVDKVMKENDELGDSISQLQKQILSLKY 
AKIAIjTERLISCPV*TEIVLNYT*SI»IM 
LLTDLL*KVHAQSHHAHSGQASNCNVMT 
IiIGLQH 


2583 


16484 


A 


2600 


131 


423 


GPAPMVILFHLTGFLiDAFLPLSHbLTSY 
LVPW I LSGTDGHTFRSACLPRWIiEAEW I 
FGGVKYQYGGNQEGK* CFFTG* SYVYNG 
SSGKVPWETFSRT 


2584 


16485 


A 


2601 


222 


447 


SGI PS FGL* WEQNRPGRLNAQ I LKDLG 
VSPGSDYGKLKNGI SSGLDNGVTTSDHD 
VLKMAIVGRKIWILGDWSG 


2585 


16486 


A 


2602 


2 


457 


FICDPLIKAIGTEGDTDVLSEIMISrSFAK 
SIEVMGDGCLNDEHLEELGGILKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQIiLPIiIVNLICSSRPWPY*QWGLCI 
FDDIVKHCSPSL 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKALLWKS VI CVHQKI 
LEEEKPCEGTKYDNIFSNRGCPPVPRMV 
HAVEIPCK*TECEKATGVHGPRGASEFL 
PERPTGMNMAKCESR 


2587 


16488 


A 


2604 


1 


405 


RFDVSDGLELRPKYNGIAHRLTTIWYLD 
GLRGLYKGVTPI IWSAGLSVJGLYFVFYN 
AIKSYITEGSSERLKASEYLASTAEPGP 
MTLCITNPLWVTKTGLMLQYDAWNSPH 
* QYKGMSDTLVKI YKHGGVQGL Y 


2588 


16489 


A 


2605 


165 


3 


PEELLLVFFPVriKNTGIRPGAVAHTCNP 
STLGGRITRTGD*DHPGQHGETLSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRMNKRSATNVFFCARKGEVIi 
GLLGHNGAGKS SSI KVITGDT * PTAGQV 
LLKGSGGWDALEFLGYCPHENALWPNLT 
VRQHL* VYAAVKGLRKGDAEVAI TRLVD 
ALKLHDQLNSPVKTLSQGLKIKLGFCIiS 
ILGKPSCGILA 


2590 


16491 


A 


2607 


1 


429 


VDYTVRKFCIQQEGDMTNRKPQRLiITQF 
HFTS WPDLGVP I TP IGMLKI LQKEKACN 
PQYAGArVDHCKA*VGRTGTF6DIEAiyiL 
DMMHT*RKVDVNGFESRNRAQSCQKVQT 
DMQYVFIYQALLEHYIiYGDTELEVTCLE 
TPW 


2591 


16492 


A 


2608 


166 


435 


KFLSNNYVHFKQNFKKVLKFIKHIiVLNY 
FKNIVLGQV*RATPGIPALWEAETGESIj 
SPRSSRPTWAMWRlTSISTKNAQIKNIVIi 
RILDPG 


2592 


16493 


A 


2609 


228 


431 


I FSKI CI FNGFNYFP I VGHIGRI YLF IE 
TGSHCVTQAGVQWCDYGSLQP*TPGLK* 
SSHLRIiPSSWDH 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLLRRLRWDYHFS PGSQGC 
SKS*SYHCTPAWVTQGDSSLKKQKRKIS 
TYTS ILSVLNETVYI I VYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 


1 


439 


LLGS IIjSSMQKPRGLVDQETLRKAR*QA 
ARLNKLQEHEKQQKVEFRIRMEKSVSHV 
V:*DSGHITT*VQSMNSIERSIIjHEWEV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ijy 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
K^Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=^Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Meth]onine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-TJnknown, *=Stop codon, /^possible 
nucleotide deletion, \=»possib]e 
nucleotide insertion 














DGLTSFYFGEDDDCRyGMIFTKDA2\PSD 

EELDSYRRAEEWDPHMADDKRKLTSIAQ 

RPDDDA 


2595 


16496 


A 


2612 


1 


439 


VIRKVTGTEGSSSTVLDYTIPSSTGGMP 
VRKSEDQTDTKRTVIKTMEDYlilNDNTAP 
AEDVI IMIQVPQS I WDQDDPESEDQDDK 
STQPISSVGKPATVIKNVRTKPSAIVKY 
PEKESEPFAKILKFTMDVSHEVI PHEVK 
SS*YSA 


2596 


16497 


A 


2613 


2 


453 


EKPEKEECNLWTEMWQENVPGSFGGIRL 
YU3ELMTITQKALHSQPWKMKAQGAIAM 
ASMALQTNSLVPPYLGMILTALVEGLAG 
RTWALKEELLKAIACVETARSAEIi*KSV 
PNQPSTHEI IQADLKECSKENGPYKI* P 
VICAADILHA 


2597 


16498 


A 


2614 


2 


441 


LEPALPGRWGGRSAESPPSGSVRKTRQN 
KQKTPGNGDGGSTS * APQPPRKKRARAD 
PTVESEEAFKNIMEVKVKIPEELKPWLD 
EDWDLVTRQKQLFQLiPAKKNVDAIL*EY 
ANCKKSHGNVDNTEYAVNEDVAGIKEYF 
NEMLGT 


2598 


16499 


A 


2615 


1 


430 


RGDRDLHCTESQSE AS TEEGHDS LS VG I 
FEEDSQLEFILDPPKSKPPAWLNGIMTC 
*DFELLNPRRVRFLLEVKDLALTRRQIL 
LHKGLSDYEKSTTIiQSIiVLKKSSRSGPP 
LS lEDLGLiNFQL* PSSRVYGFTAEELKP 
SGE 


2599 


16500 


A 


2616 


373 


552 


ICKIKYKLiFNSALLFFRQGIiALLPRMEC 
S GAI KVHCS LDPLGS SNPL * RKI KESTL 
NLEKSLCTRGIFLCKYEEVPKQFLKICF 
Q I FLETGSRS VS QAE VQWHDQNS LQPRT 
PGLKS FS CLNLPKCWDYRCE PPRRALCS 


2600 


16501 


A 


2617 


498 


157 


QLIGSSTHQAALjRSRSLPPPAGPGTFHF 
HYQGKAW WKGGTS PDSLPSLLGRGVS V 
QIjHPRGKEQRGASDT*HKCPVKLWTIGG 
KYRVSETSRIFSLPPTTLQRAGLDSGSG 
Sli 


2601 


16502 


A 


2618 


255 


389 


LSEFYTYEGPSIRPPIGS*GTNLPLPLS 
YIPRSPSAVDDENLLDE 


2602 


16503 


A 


2619 


3 


207 


QHSSLIIIRAIAIKLGIAPFHF*VP*VA 

QGTPLTSGLLLLT*HKLAPISIIYQISP 

SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


Q YSSLI I IMGVGIICLGIAPPHF* VPEVA 
QGTPLTSGLrjLYT*QTriTPISMIYQISA 
SLNVSLLLSGT 


2604 


16505 


A 


2621 


45 


447 


WRRIDCRIjNLCVTTASLKHFFS IAHKFH 
LHNGSHLQSQHSCEAQLRRTARFI XCCL 
PYGESGQSWPTLTLERANLNWLTVYPNN 
WKDWEYDSDQYMSNPDRSSEIVUyiNI I VW* 
RSIDNLPHIICSTIiIIJTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GNQATPKTAPATMSTPTILVATAVHAYR 
* * VAEKEHPIiKFGGRACS 


2606 


16507 


A 


2623 


2 


453 


HGPGGLLDYIDKERIRDFLNGECMCEVP 
DGGLVPKSLYRTA*BLENEDLKLWTDTI 
YQS AS VFKGAPHE I L I Q I VDAS T VI TWD 
YHVCKGDIVFNIYHSKRSPLPPNKDYLI 
AHSITFPDGlSnSfVHLimKVWMLGPDYIMW 
ITTLITNNEN 
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SEQIO 
NO: of 
nucleotide 
sequence 


SEQW 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I==Isoleucine, 
K=Lysine, L^Leucine, M=Methlonine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Va!ine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possib1e 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKIiTPLFCAHAVRFI I 
S TSL I YKGAYP * ALKGKEKY * LAVFWVY 
NKKGLlSniJNNLFLDGVHLCFVPEV^ 
GKGLPFS IGLYHLILLLFFFF 


2608 


16509 


A 


2625 


232 


478 


LDCGNYFTV*MYIKTSHCIP* lYIILNF 
HLYNFRYNIiGDYWGBIVSEVMAQRQPMK 
PTYAIPIIPITNSSQFKHQEAMDVKE 


2609 


16510 


A 


2626 


3 


430 


TSTPNVHMGSTSLPGDSTTIEDAIQSHS 
SSASPSAIiSSHPmiiSPTGWSQPKTPVP 
AQRERAPVSGTPDRDKIjRPCGQRD*GYY 
WEIEASEVMLATRIVSGSFGTVYQGKWH 
GDVAIAILiQWDSTPEQFQGFRNDVAGLi 
RI 


2610 


16511 


A 


2627 


2 


341 


AliQKHEDTDCPCWVSCPHKCSVQTIiLR 
SE*SAHLSERVIAPSTCSFKRYGCVFKG 
T YQQ I KAHEASF WQHVNLLKEWSNSLK 
KKGSSFDKECVDKNKSIHCWHIQICSSV 
I 


2611 


16512 


A 


2628 


2 


179 


RHTGLWVTSLPAVFPGQVRRTLF ITGLP 
RDARKETVESHFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQRCQEGDCGEPLPVS 
RW VGS FL I H Y I QEHCL 


2612 


16513 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPGLK*S 
CLGLPKCWDYRHEPLRPANISY 


2613 


16514 


A 


2630 


3 


442 


FTCGTI I lAIPNGVKEFT * LATLHGSNM 
K* SAAVL * ALGYI PLFTVGGLTG I VLSN 
SSLDIVLHDTY YMVAHFHYALS IRAVFA 
IIEGFIH*FPLFSGYTLHQAYA*IHFTI 
IFIGVNLTLFPQHFVGIjFGMPRQYSDYP 
DAYTTR 


2614 


16515 


A 


2631 


2 


454 


AAPMELI CWSGGWGLPSVDLDSLAVLTY 
ARFTGAPLiKVHKI I NPW* S PSGTLPALR 
TSHGEVISVPHKI ITHLLKEKYNADYDL 
SARQGADTLAFMSLLEEKLLPGLVHTFW 
IDTKNYLEVTRKWYAEAMPFSLNFFLPG 
RMQRQYiyi*RIjP 


2615 


16516 


A 


2632 


131 


410 


WMWSSKAPHCFRLPSIGDADTVHQCAMS 
FQKGHSALSGVLHLVFKPDLVYQTLLQM 
PPRKCCLWPGAVTHACNSMTLGGRSR* p 
TRSGVQDQP 


2616 


16517 


A 


2633 


2 


230 


FFSETSSLIiEIQGIARHGGTWRIiRQEWN 
DNPGGGGCSELRSCSCAPAWVTVRIjDLR 
KKRWRZSrP*KILKNYLKrPVI 


2617 


16518 


A 


2634 


2 


370 


GTSSSDPAQPGDDKEFIDASRLVYDGIR 
DIRKAVUyilRTPEELDDSDFETEDFDVR 
SRTSVQTEDDQLIAGQSARAIMAQLPQE 
QKAKIAEQVASFQEE*SKIjDAEVSKWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


376 


MTDTDALY*RELFDPADKDKMDHSRRGI 
AliVFNHERFFWHIiSLPERRCTCSDRDNI, 
TRRFSDLGPEGKCPNDLKAEELLLKIHE 
VSTVIHSDADCFVCVFLSHGERUHIYAY 
DAKIEIQTLTALF 


2619 


16520 


A 


2636 


1 


461 


DMAFLILTERKILGYGQGRIGPNVACPY 
GLLQPFGDAI *LPTKEPLKPVTSTITLY 
I TAPTLALT I ALLL* TPLPI PNPLGNLN 
LGLLF I LATS SIAVYS IL* SR* ASHSNY 
ALiIGALRAVAQTISYEVTLTI II^LSTLIj 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 
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Q=Glutamine, R=Arginine, S=Seriney 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=po$sible 
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IKGSFNLRSLITT 


2620 


16521 


A 


2637 


2 


384 


EnSJFKAIiAMIAFGQYRQKRPFEDHVKLAN 
D*IjNFAKTCVADESAENCDKSI:iHTLFGD 
KLFP I ATUiETYLELADCCAKQ* PD IHE 
CFLQLKYDNPYIiPRtiVKPITODVMCTAFH 
DTEETFLK*YLYEIA 


2621 


16522 


A 


2638 


1 


373 


TFIYLLFLFSSAYSRGVFGRDAHKSEVA 
HRFKDLGDENFKALVL lAFAQ YLQQCPF 
EDHVKIiANEVTEFAKTCVADESPEN*DK 
SLYTLFGDKLCTVATLRETYGEMZ^CCA 
KQEPDRNECFLQ 


2622 


16523 


A 


2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKA*GPHCEKCTLPCTFNEEPLKAIj 
TPFREHGP*VSDPEVATAPTEKEIPSLD 
QETTKLEPGQPQLSPGISTIHIiHPQFPV 
VIEKTSPPVPVEI 


2623 


16524 


A 


2640 


215 


478 


KYFLASHTSLFI I CYTAHLTCTIAEPKQ 
IESHFGKRLDADLVF*KSDDSTVVDVIP 
SKPV*TSGLFSGKCL*HIVEGIIRAVDP 

RKLY 


2624 


16525 


A 


2641 


208 


376 


ILRNLIKNHFWPGWAHACDPNTLEAM* 
GQITRSRDWDHPDRHGEAPSVLRIHRIiA 


2625 


16526 


A 


2642 


424 


2 


KGE P L P PLGGWKT * GPRAF P QAQI PHAG 
FKTRGCPFPLPQGRNKAQVPNPFCPRER 
FLPGKGSGRVAPLKNGAQLGGPPFYPHP 
FGGPSPRVPLGFGVQTHLGNKPKPPFFP 
KKKKGGTGCSVGNWGPKSSQTIjPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELS TRGFQGPHRR I AGL KQ CLVMFPELE 
RSGGLSEH*HLHLPGSSDSRASAS* IAG 
ITSVCH 


2627 


1652S 


A 


2644 


2 


339 


CCEKPLLEKSHCIAEVENDEMPADLPSL 
AADFVESKDVRRNYAEAKDVFLGMFLYE 
YARRHPDYSWLLLRLAKTY*TTLEKCC 
AAADPHE C YAKE FHEFKPIi VEE PQNL I K 


2628 


16529 


A 


2645 


81 


369 


VEVTGQPQNAS FVKRNRWWLLPLI AAIiA 
SGSFWVFRTPDG*TSWR* ihnhmtqli 
NNHLRA* YS VYRD I YFLW I AI * * S S * PA 
SVLLFTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQNYRFG*ELKADIDYHFKE 
DNDEDDHQIiSLRTVSIiWAGAKDD*HIVE 
SEAMNYEGCPIKVTIiATLKMSVQATVTL 
GGFEITPPGDLRMKCGSGPVHIMGIiHLV 
AGEE 


2630 


16531 


A 


2647 


1 


178 


GYTDTIIiDVRSQRVRSLLGLSNSEPNGS 
VETKNIi * QILNGGES PKQKGQE I KRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TIKRYKNVMIFYYFNFF*R*GLNSIjAQA 
AAQWCaSTHGSLQN-QPPGLKGSSCLSIiPCS 
*GyTIFYLSIHQLIDrWIVSTFSKLtFLH 
VSAYSSIKMS 


2632 


16533 


A 


2649 


2 


369 


KWITFISLLFLFRSVYGKFBFLLDAPKNr 
DVAHRFIDFGEQHFKALAE I AFDHYFLH 
CPFEDIiVTIiWE*TEFA*TCIiADESA*N 
CDKSLHTIiFGDKLCTVATLR*TYGEMAD 

CCAKLEPERI 


2633 


16534 


A 


2650 


1 


349 


VTFISLLFLFSSGYSRGVFRLDAHKSEV 
AHRFKDLGSENFKALVLIAFAHYLQQCP 
FEDHVKLVNEVTEFA* TCVADESAENCD 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine5 S=S^rine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=spossible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSLHTIiFODTLCTVATIiRETYGETADCC 
AKQD 


2634 


16535 


A 


2651 


21 


403 


CVCCVCVRESRTESLPGDNEDFNVKNAS 
VKDVRCVHFDCDSQNDPPMEATGPTAQV 
TI YSLINLLHRLKYFETLDLH* lAIFFS 
KVS ILNHPGQI lADYAPALDCHTCHIAC 
KYAELKEMIDRRSVK 


2635 


16536 


A 


2652 


1 


355 


ARMS ITDXYGQHBIAGGLMTQEDVSE I K 
SSYYAKSNDPJIiimMAHYRPTALlSrCiQAHW 
HGIiAQAEAQITTWSSGVPFDLLLMVGMM 
TVQVP * ELQMHSHLLKTHDQSRMENMMY 
GIKLDW 


2636 


16537 


A 


2653 


2 


360 


LFBHIiG*YKFQNAIjrjIiRyTMKVPQVLTP 
TIiVKVSRNL*KVGIKCCKHPEAKRMPCA 
QDYLSWIiNQLCVIiHEKTPVSDRVAKCC 
TESLVNRRPCFSALEVDETYVPKEFNAE 
TFTFH2UD 


2637 


16538 


A 


2654 


360 


314 


YTCAIVPCKLFWWCCRDRVSLCCLyWSQ 
TPELK*SSCLSFPKC*NYGHEHTWPARI 
IFLNLYNKSG*TATRKCPPTKVTPSSHQ 

Q^HCTWPYFYFK 


2638 


16539 


A 


2655 


1 


353 


NWTL * TL.KKI FSLKDAIKRT KI QSVRWK 
KI FAKHTSDKG VES KYI * RTL INQ* ENN 
S I KNWS KNTLNRRFTKD IQKANKHMLSAT 
SLVH* ES KLK* Q * YHYTS IRVAKMKKTD 
YVTSW 


2639 


16540 


A 


2656 


340 


10 


GREDKSEKCYLKPGRSQPDRGAPKSSPG 
VP*PPAPPALPGPGRSSPFPQGSLiAGKD 
LRPRQPSQPGGPGELIFPVKTKKKKIKE 
RKTLLAWTRSNERPEIAQREGGCLRQ 


2640 


16541 


A 


2657 


509 


261 


IPYFKIHCLHSALGVAETEKETAEHLDL 
AGASSRPKDSQRNSPFQIPPPSPDSKKK 
SRGIMKLFGK*VKQ*TSGMGPVLHVA 


2641 


16542 


A 


2658 


345 


3 


SAHLSHPKCWDYRREPLHLARISFLFKA 
SSPLYGRATFYLSTPPLMDGRWVASIFW 
LL* IVLP * AWVy KYLSETPLS I F*KTGS 
LSVAQAAVQWHNHGSLQPQTPGLKRSSH 
L 


2642 


16543 


A 


2659 


271 


56 


VILYIVMPIPRPCPHPHPYIHTHTHTHT 
HTHTHTRLGKREF*IiYSW*LPGYSTNGN 
KIFIiEIYLtiPILLKM 


2643 


16544 


A 


2660 


366 


68 


PGQQSKIMPQTERKKGRKEEREEREEKE 
EREGRKEGRKNEGTETVIEGGSSKTQEQ 
TRIKAS *LiNSLYTTVKPQRRLKGGKKHP 
KDSNFKD*RNISPHR 


2644 


16545 


A 


2661 


351 


199 


liLRRLRQENLLNPEGRGFSEPRSHYYS P 
DWATE * DFWSNKI K* QKVQYSKY 


2645 


16546 


A 


2662 


I 


160 


ERAWLHLSPGDGVCSEPRSHHCTPTWVT 
E*DSVAHQPGWDRVRLCLKPTWVTE 


2646 


16347 


A 


2663 


233 


3 


GRDNWLKTVAKPESLSGKRGPSLIiRKVP 
NPFPVYILFLPQAPKGKWLSPFPPFFFF 
FFFLRGSLAM* PGCSAVAQS 


2647 


16548 


A 


2664 


96 


349 


QMPSSVLTRVANAYYSGFFFWFLKQKFC 
FVTQAEVQGRS IS * PGPLVPGFKRFLGL 
TLLSTWNCRRAPPGLVIACGFKMSLLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTLPRLNQVESIiNRPITGSEIEAIIN 
RPPTKKSPGPDRFKAKFYQRYKEELVSF 
LDKLFQ* lEKGGIGWARWLMPVr PVL 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 
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Predict- 
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location 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=po5sible 
nucleotide deletion, \-possibie 
nucleotide insertion 


2649 


16550 


A 


2666 


2 


228 


NLALSPRLECKSQGFTMLIRLVSVS*PC 
DliPASASQSAGIMGVSHHAQPPLHSYLR 

PLilLIKGDQFKYAFNLFSC 


2650 


16551 


A 


2667 


129 


301 


VRPLAGLiGSPLIFFFFFEKSIiAFVAQAG 
GQRHNLG*LLPLPPGLKGFSCLWLPSTW 
D 


2651 


16552 


A 


2668 


318 


21 


FSLLVKDFQNDIiAVNS SS I FPSTFLFLV 
RS*LDFVNLNFFNFYLFRDEFSLCCPGW 
TRIPRLKQSSRLRIiLSSWGYKCVPLHPA 
rNRFL*CDFSISGY 


2652 


16553 


A 


2669 


2 


322 


QDGLDLLTS*STRLGIjPKCWDYRRBPPR 
PAPINKCLLS I YHGPGPSLGTEGEKGQD 
YIPAI EQIiTDQWRRLTWTGCYCAAVAED 
SMGWWPLIjPGYEAEKEAGRFH 


2653 


16554 


A 


2670 


29 


261 


EFNTLSKCVWIY*LFSIiSLSLSliCVCVC 
VGVCVCVCLCVS VCLF YGS P I CLHFMHP 
DTLSEES FHFLQI ISKELLKC 


2654 


16555 


A 


2671 


5 


224 


GSAWWLTPVIPALWECKAGRSPEVGSLR 
PA* PRD P PALiAS ES AGITG VKKJNTL. r AEL 
WEVKSCILLIFISLiDKV 


2655 


}6556 


A 


2672 


126 


321 


WGRRGARARRDSHRALRVS PELALLELR 
P S TMAH I CNPSALGGRGGW I T *DQEFET 

NliANLEKPC 


2656 


16557 


A 


2673 


360 


74 


SRLLRKIjRQEVGLNLGGRGCSGPRLHHC 
TIjAWATERDPDSKNKTNQKNKQRR* *tl 
PETNSLLAWEPDCVCKTNKIiATRLEIMA 
QKSCSQRPQYH 


2657 


16558 


A 


2674 


192 


357 


RIRKWLYYYYYYLLRWSLALSPRLECRG 
T I LAQCNFHL* S S SRS PASAS P VAGIT 


2658 


16559 


A 


2675 


236 


375 


EEVLYSRKDRHVASYSRK*W*RSGAEAH 
ACNPSTIiGGRGGWITRSG 


2659 


16560 


A 


2676 


236 


2 


atkvs ins IGTLGVKVGGLLEGRRLRPG 
PKQSSCLGI,QSSWDHR*VQPHPG]srNFTL 
FVETRSMLARLVSNSWPQAI fq 


2660 


16561 


A 


2677 


164 


389 


LTVQHGLRGLRKLTVMVESEGQASHLLH 
KAAGRRSECQQGNCQMLIKS s yqpcaea 
HACNPSTVGG*GERITRSG 


2661 


16562 


A 


2678 


208 


1 


CSVYEGSCFCILRVRRVCVCVCVCVCVC 
VCARS CI YVRSENH* KQLPS SS I LT*RF 

*iqstfrsnkrlr 


2662 


16563 


A 


2679 


159 


382 


rwliknhptqariqvrglldrdgqtqtw 
lwesgvqplattprrhsemggwpgavah 
acnpstlggrgg* itpprpeskledclt 
gtarhrlgcgsqgcslwlpppggiqkwg 
vgraqwltpvipalwvaeagrslevr 


2663 


16564 


A 


2680 


29 


367 


dcqseqlrklrwehhwsqgv*gcsqp*s 
hhwtpawvteqdpvqnkqtknynthqra 
gevwrbinciisteqltelyseitqilaii 
s vrn* kiil * yvks s vi s aetq i sggki f 


2664 


16565 


A 


2681 


256 


1 


ENIVPVRFLCIHRLFAILIMVQ*IiSRKR 
DHIVIHLNLTLETVYLKKWQTRPNAVAH 
ACNPSTLGGQGRRITRSRARDYPGQHGE 
T 


2665 


16566 


A 


2682 


1 


395 


LLIEHILIAMALLILTERKILGCIQLRK 
GPNWGPYGLIiRPFTDAIKLFAREPLQP 
VTSTITLYMTAPTIiAVTMAVIiL* TPIjRI 
PNPLVNHDLGIiLCTLATSSLADY* ILi*S 
R*ASHSNYALIGALRACPH 
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SEQ lO 
NO: of 
nucleotide 
sequence 


SEQW 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQm 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L^Leucine, M— Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \='possible 
nucleotide insertion 


2666 


16567 


A 


2683 


3 


349 


VRIDNAAVCLLY*AWAEFIWALANLI IR 
PDLDHPGNLLGNDHI YNVI VTAHAFVI I 
FFIGIPIIYGGFGN*LVPLIIGAPDMAF 
ARlTKriRF*LLPPSLLLLLASArVEAGA 
AGT 


2667 


16568 


A 


2684 


99 


2 


LTVAHACNPSTLGGQSG* ITRSGVREQP 
GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSFIINSFIIIFLYTTIFLIFYDV 
VYSPEFKP*VCFCIiTECWDYRREFQCSF 
LPCKRV*VriAPFIKNFFPFFPFLRWNFl 
FVAQAGVQWRHLGSLQPPPPNFK 


2669 


16570 


A 


2686 


382 


269 


MGFHHVAQADLELLGTNDPPALASQSAG 
ITGVSHHAQPKLTF*TFSY*ACVGGSSP 
QFILWQHGLAGNS FCCF 


2670 


16571 


A 


2687 


3 


381 


GHLYALTFFFNSQICLFPRIYKTCKKSF 
WLPFLNLVSLCAIEGYKTKKVPSNV*KG 
IFIHHQTGFILRMQGWLNI*TSINGIYH 
I SR I EDKNHKTMF I DAEKWFHNI *HP FI 
lENIRPGWAHICN 


2671 


16572 


A 


2688 


289 


91 


IKYKELGVACSKTSRNVSYYCYHHHHHH 
RCHHYHHHS *RIiAAIjCEESGWRKALGST 
SMGRVAFHRH 


2672 


16573 


A 


2689 


3 


211 


LHHVGQDGIiDLLTL * S APLGLPKGWDYR 
TEPPFPAPNEFLLLHSLLSKHYAIiGRPH 
TPRERKKASGILR 


2673 


16574 


A 


2690 


2 


197 


DLLTS*SAHFGLPECWDYRREPPRPASY 
AFI KYFCPALS CFV^CIETGP YL 1 FAYLR 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVPVKMAI VKKTRITNRCW* GCGE IGTFL 
HCWWEC 


2675 


16576 


A 


2692 


211 


339 


PGHPLS I ILQWIWGEDTRRGPMHEALH* 
PCCSELRSCHCTPAWPTE*NSIF*KKKK 
KWLGELKEKQKNMACANDP * PGHPLS 1 1 
LQWIWGEDTRRGPMHEALHGKSGDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


RHEGLNLGGRGCGEPSSCHCTPAWVTE* 

DSVSKKKRKGKKKSALILIFSNGVLFCH 
LG 


2677 


16578 


A 


2694 


234 


13 


KTSREPWSEFLSIKGEQQCSFGLIjYVFC 
FQMESHYAAQAGV*WCNLG*LQPPPTGP 
KQFSCLSPLSPSSPASAS 


2678 


16579 


A 


2695 


149 


2 


SQGEDFTKTRALQWRSRRSWAHACNPS 
TriGGRGWWIT*GQEFKTSLAN 


2679 


16580 


A 


2696 


2 


194 


CIGLGWAGACSSRLRQBWDMNLGGGAC 
S ETRSHHCTP AWVT E * DS I S KKKRKRWR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


YLIRRKKI SNSKS * FS WPGWAYACNP S 
TLGGRGRRTTRSAVRNQPGQHGDPKFFL 
VSS 


2681 


16582 


A 


2698 


53 


293 


TI S I KS KKWKNKQGE YNKWCMENWIiAI C 
RRLKLCP YVLVHS C I AI KNYLRLGNL * R 
KE V* LAHS S TGCTGWMAEKPHGRR 


2682 


16583 


A 


2699 


2 


335 


LYAAKVPLQKACIRTFSYISFFIFFFFF 
FFGKQSSFGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLIF*PGGVPNSDPKGIGPPNPP 


2683 


16584 


A 


2700 


166 


3 


D YRRVPPRLVKR * GFT W*AGV* WRNHG 
SLQTQTIGLKQSSHLSLATCWDYRYE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

annino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metfaionine, 
N=Asparagine, P=ProUne, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=po$sibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


2684 


16585 


A 


2701 


209 


340 


YRMTVFPARHGGSHTCNPSTLGGRGGWM 
T* CHE lETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


EVACNCIiLPAIGCFAS VTYV* S S VLL * F 
EL*VLNENSSF*NPIH*PF*NRVLICCP 
DWSVMA*S* FTEAL.TSQTIRSSHVSLPS 
SWDY 


2686 


16587 


A 


2703 


258 


185 


TMIAPVHSSLGDRARPYLFKQKR*GIiAL 
SPRLEWTGAI IVHRNLEFLCSSNPPTSA 
SGVTGITEMEVERHGWIQETFWK 


2687 


16588 


A 


2704 


361 


41 


NFGPVYKTNPGPSRPGWGPLGKPPGPQK 
FPPFGVPPSPPFPKKGSVFSPPGGQKGP 
FKGVFWGFP*PRGKKTPTQQPGPSPKVP 
NWDFPKGGFPKKKKKKRKILRN 


2688 


16589 


A 


2705 


12 


329 


SCTLQLTTPRLK* SSRPSLPNN*DYRHV 
PPHPASFP*FSEMF*DSTQIFLNSLKFS 
S * YYLVIFFFKMEPCSITQDGVQWCDLS 
SLQPLPPRFKRFS CLSLPSTRP 


2689 


16590 


A 


2706 


232 


1 


KRKRKSFKTYYKKFFLKSSFKNQCPYAI 
GHIFRSIIFFFFLRQSFGLVA*AGVQWQ 
DLGSPQPPPPGFKPFSELRLR 


2690 


16591 


A 


2707 


135 


1 


CFFLIFFFFSSDTQAGV*WPQPPGIKRS 
SQLSSPRSWDYRRISPR 


2691 


16592 


A 


2708 


1 


157 


LDLLTS * SARLHHPKCWD YRRE PPRPAD 
YKYFLKEVASLIVKLYLFCKLNPE 


2692 


16593 


A 


2709 


88 


344 


MQEHYYFIKRGNKMI IRKYVQLHANKIiD 
SLDKIDKFQVKNHQN*L*KK*KSWPGAV 
AHACNPSTLGGRGGWITKS *DRDHPWLT 
C 


2693 


16594 


A 


2710 


1 


383 


LHKVTT VLRFLS KFHILVLYF YYTCVHV 
SVSDMWGWFLY* IFTDMYNAVFQ*LDFF 
KLS I KFLRCI C* CVY* I FI PVFI YCFFL 
YLICLFLSYFLFNIYVMLLFYILCIIYF 
YVIFSSSSIF 


2694 


16595 


A 


2711 


227 


1 


IFFFSNSFLKISLGFFLNFAFF*KPKFF 
PFLSFKINQNFPFGFPFNFLKIFFKFFP 
NKFFKP*IYLTPFKKTKPK 


2695 


16596 


A 


2712 


1 


136 


KNTKISQGWWPTPVIPATQEPEAGGSLE 
RGRQRLQ*RNLGSIiQPLPPTFKRSSCLR 
LLSSWDYRRRPPPLANFCIF 


2696 


16597 


A 


2713 


3 


349 


LI PRGQGST WLP YNPATS I FGNDLNE I 
KMYGHAKTCIWMFMTSSFI IDRTRKRQ* 
CSSVREWINKQ* CIQTOELVFGRNFFFF 
FPFFFFPFFFPPPFGGPPPPKlQrPKEI 
NPQ 


2697 


16598 


A 


2714 


181 


342 


TSQLLNIiPLP 1 1 KA* AIRQEKE I KGI QL 
GKEEVKLFLFAHDTIVYP 


2698 


16599 


A 


2715 


3 


205 


PRHVAQAGLELLSASSLPTLASRSAGIT 
GVSHCVQP * TVS SLRLGF YDTS PKI AD I 
QS CFS WVL YYCLQS AEL 


2699 


16600 


A 


2716 


388 


23 


ASNQSKNSFEKGEKNQQSVMVKTSQQAL 
NKRAFFDMIRSVYQKTYK*RNA* *ERQS 
FSLKVENKKEYPSSLLLFNIVLKVLVNT 
GHKEK* KAHRFDRNI IQFAliTDMI VYVEN 
PKDSTKRLS 


2700 


16601 


A 


2717 


4 


194 


FSCLSLLSSWDYRSMPPHPANFLYF**R 
RGFTMIiARLVLNSWPHSAGITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


1 


212 


liCCPGWWETARLKRSSRLSLPKCWDYRH 
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SEQ ID 


SEQ II> 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


0=Cysteine, D==Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=IsoIeucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, I>=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














E SLHLiAFGFFLL * Y PGPYLMVYVFSPNL 














YDFSLVYSLYGTSI 


2702 


16603 


A 


2719 


53 


299 


KAVHWIFFITKVHLTMETQIVSLSLFP 
FPSFLHPSLSLSSFIi*RDRVLLCYPG*S 
AWQS *FTWPQIPGLR*ASCISLVSS 


2703 


16604 


A 


2720 


139 


320 


QPTMRHSFSF*KQEISIFLKVSSSLHHT 
LRFLFFFFLLDRVSLCHPGWSTVALS*L 
TVAW 


2704 


16605 


A 


2721 


1 


99 


IILiAYSSITHIG*IIAVLPYNPNrTILN 
LTIYM 


2705 


16606 


A 


2722 


215 


361 


LTFPFFFFERESNFVTQAEMQGGNLV*L 
KPRPPGIiKHFYYLSLQSTGNY 


2706 


16607 


A 


2723 


335 


24 


AIPLRPEF* THPGQNGKTRFPLKPQNLT 
GVGGNARYSQFPKRLRLKIPLNPEVGPS 
INPNFHPGI PFWAQKENFFPKKKKKKE I 
IMLNFSLrCYNLFFLYICY 


2707 


16608 


A 


2724 


27 


314 


WKQLKYPPSDEWINKMWCIHAMEYLAIK 
RNKVQLCVT^RWRWLMDTKKK 


2708 


16609 


A 


2726 


1 


112 


GFHHVSQDGIjNLLTL*STHLSLPKCWDY 
RHEPLCQAR 


2709 


16610 


A 


2727 


271 


3 


RSPMLKNQAIKSVIMVYIGTIWRSAGPF 
AIFFIAEYTNI III ITIiTTTIFLGTTYD 
ALSHELYTTYFVTKAVLLTSLFL* IRTA 
YPRFR 


2710 


16611 


A 


2728 


121 


1 


ENKYRPGAVAQAY I PMTLGG* gg^it*g 
REFETSLTNVEK 


2711 


16612 


A 


2729 


2 


290 


NRHFTKENIQiyiPTRHKKRDPQSSLVIRE 
MQTKTKVRYYFILTRMAKVKKIDWTTC* 
* G YE KLNS PKLQTTKCP I FE WVNKLWHS 
QIVEYFSGIQHA 


2712 


16613 


A 


2730 


98 


264 


IiVSSE***iyQILFFFPEMECHSVSQW 
VRGSNIiC*LHPLPPDFKRFSCFSYWEG 


2713 


16614 


A 


2731 


2 


286 


PFYKATVIRTVWYW*KN*QIDQWNRKES 
PE IDLHK * S LiL I FAKGAKAI Q WRKDSL F 
WKWC*NRWrSTCKNESMLGVVAHTCNPS 
TSGGQGGRIA 


2714 


16615 


A 


2732 


323 


181 


RDFVLLVE I TMLARLVSNS * PCDLPASA 
AQSAGITGVSHRARPFLDL 


2715 


16616 


A 


2733 


272 


131 


GRVDRIiNPGGGGCS*LGSCQCTPAWVKE 
TPSQKKKNKI ICFYAFLFI 


2716 


16617 


A 


2734 


227 


87 


AASTLTLTSLIPPILTTLVNPNKKNSYP 
HYVKSIVASTFIISLFPTTIFMCLDQEV 
IISN*H*ATiyiDFT 


2717 


16618 


A 


2736 


25 


415 


FSKYLLDHQLSAGLliVEQHRGSLCPLSIi 
LHSDSSLSFAVSGKVSIiAASIRNKLELP 
ET*RALMM*NHGHPLEFGGM**PWT*EP 
GS *RC * ELGQPGQARRIACNPSTLGGRG 
GWIMRSGVRDQPGQDVET 


2718 


16619 


A 


2737 


390 


1 


KGFFLETKRKFFFTNGFFPLLGGKGVPF 
PGKRLGPPILGFYPPQRVFPFPKPPFFP 
PG PFKRGQ KPPGGGKNF * RGP PGP P KKG 
GAPTGKKKKKEEERKEKEKKTPRNISKP 
LSHPFPNFPFCATVRGFS 


2719 


16620 


A 


2738 


96 


331 


QFTYTHGLFPFNFNRLOTFFFS*KLVGG 
V*LCCPGWL*TPGLKQSSCLSLLSTWDY 
KGVPPCLAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LREVFPFFSPPKIKCFSmSPQKYFPPP 
WFKTPPTPPFFFPPLFGKNFFFPSLFN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=GIutamic Acid, F=PhenylaJanine, 
G=Glycine, H=Histidine, I=IsoIeucine^ 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCPPRGFF*RPPLFFFFFFFFFFFFFFF 
FFFFFFFKVFYSLIiLLYFLPILK*MLTF 
KTSISTQCY 


2721 


16622 


A 


2740 


198 


38 


ANPLFPFFFFLRDGVLVYRPGWSAWQS 
*LTADSTPWAQVILQPQRNPIEHDF 


2722 


16623 


A 


2741 


290 


39 


MSKVGPIjIiGVLGTS FRVLLKGPEIKGDP 
LLANPIiFFLFFFLRDGVLVYRPGWSAW 
QS*LTADSTPWAQVIIiQPQRNPIEHDF 


2723 


16624 


A 


2742 


1 


399 


ILIRKHSDIGTLYLIiFDA*TGTrGTDLS 
VIxNRTELEPDQATLLGNDPVHEVIDTAH 
ALGI I PFI VI PHI GGFGN* LVPLI IGA 
PD^4AFARI1SINISF*LLPTSLLLLLASAI 
EEAVPGTG* TVYPSLA*NyCR 


2724 


16625 


A 


2743 


398 


2 


SPPPLXFFLWGFSYIFPPPKKFFFLINP 
PPKFSPPPFFFKPPPPFFFFFPP*EEKK 
NFCSPPPFSPPPPFPLLPPPLFFFPFFS 

KKKKKKKRARSRSRTSPTRP 


2725 


16626 


A 


2744 


295 


185 


FFFFFFFFFFFFFFFFFFFFFFFFFFLF 
FFLFXFFXF 


2726 


16627 


A 


2745 


374 


4 


LKKHKSTRVP*NVKSGK*NFSPPFKIRP 
WAQKRAKKKGAREKKAD*EKGEFGKL.KK 
KKS FRPGKKV* G* PKGFQRNFEKKP VKS 
QGPVNMGVIPAPEKKPRIPPVAKIPI *G 
AQKKKGRPLAI 


2727 


16628 


A 


2746 


2 


382 


QDATAP IIDEL I S FHDHAD II I CLI CFL 
ILDALFIiTLTAKLTNTNISDAQEIETV* 
TILPAIILDLIALPSLRILYITDEVKDA 
S LT I KS I GHQWY * T YEYTD YGGLVFNS Y 
ILPPLFLEPRDLGIiL 


2728 


16629 


A 


2747 


2 


374 


DWPPTVKRKTNPRAQSTAADRFILLFTV 
RGLTG I GIANS S LD I VLHDT Y Y WAHFH 
YVLSLGAVFAIIGGFIH*FPIjFSGYTIiD 
QT YAKI HFT I I F IGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


QI PAK* INKYLQN* KKKKKKKKKKKKKK 
KKKGACLKKPHGGAKEKSWGRQKLFSFK 
GG 


2730 


16631 


A 


2749 


161 


3 


NGNYTLVKKCFNTKSKIKWVKW* IFFFF 
MRWSFTLVAQAGVQWCKLGSLQPL 


2731 


16632 


A 


2750 


362 


1 


PEKKS FAX KGVKKAGKQCKFCDAPKKGN 
PPKKKNPTGTPKRVLKK*PPREEKKNSP 
QRKKKMTAKNKEKEAPQEEI*KGPPRYI 
FFFISFFPFFFFYFPFFFFSIVINLFGT 
FIATTLEA 


2732 


16633 


A 


2751 


311 


110 


ITLPRHGGPHVQSQIiFKRLRWKDHMSIjG 
DRGCSELSSCHCTPVWRQSKTFSKIKYG 
RNGTDKAATCNPGRPRSWGRKVA*GQES 
ETSLSlSrSRTRL* KKTKIWSKRDRQGGYL 


2733 


16634 


A 


2752 


223 


16 


KKTPQKKKNTEGKGPVKKSPPQKPLKST 
PP^IFFYVFCFFFFFIiYFIYLLFRIFSF 
FFILFVYYLFILP 


2734 


16635 


A 


2753 


367 


10 


PAPRGGVYRGRQASLSCSGLHPLRASWP 
IiCSPTQA*AMAGAPPPASLPPCGL.ISDC 
CASNERGSVGVAIiS *SGAGDNLLVCRIiL 
SGKCRNHPSSASLTLAAVDWSCSYSAIIi 
APPLKNS 


2735 


16636 


A 


2754 


14 


341 


PGPNF*FGGQNSPGKNQNLSLPT*KGNG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etii 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L^Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaUne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGNPKNGGPTVPGTPPETRGSIiLRTLQG 

PLRPSGTPPTRRAPGTPVGNPGF*PRDF 

FAQKGPNVREGLKQGPNPKGLWPPP 


2736 


16637 


A 


2755 


15 


172 


HSWWECKLVQPL * KTVRRFIiKKLKIDPP 
YYPAI PLLGI YSEERKSVKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPLPPRNDPVFPKGPERTPRPTWGFIP 
GGVPLGKFFTLLVSPFFFFFSPSPNLFF 
F*NRVSIiCRPGWSAVV*SWLTAASTSQA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFLRQRLALLPLECSCVILAHCNLRL 
IiCSSDSPG*QSETVSKKKKKKRKKEKET 
GNL*RSIiDMKGR 


2739 


16640 


A 


2758 


23 


328 


NITY YT* IjFVLYFYVPGHTWFFFXFXF 
FFFFFFFFFFFFFFFFFXFFFFFFXFFF 

fffxffffpfffffffffxxffffffff 

FFFFXFFFPFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


dr*lfstnhgyigtlyllfga*agalgt 

ATjSLLMRAELGQPGNLLGLDHTYNGIDT 
AHAFAIirjFMSIiAIIIRSFGN*LVTLII 
GAP VLAAPRI TD I S W * L * PHS VLLLLA 


2741 


16642 


A 


2760 


120 


2 


INKYAWLGTVAHACNPSTLGGRWTA*GQ 
EFKTSIiANMVK 


2742 


16643 


A 


2761 


251 


3 


CRGQ I GT PTLHYS S S S S S KWI KDLNVRV 
KTIKLSENTGVNLRELELGDCFLDDTKS 
KNSTRKIRK*SLKIiETFFIiGGTR 


2743 


16644 


A 


2762 


11 


303 


AliVLKRSSCLDLLKCWDYRHEPLHLALS 
SILMYRFGL*IFDIFFYWYSREIDIiIFV 
Y*FFFFFLETGFWFIiLPRGDLG*WEPPP 
PGYKGSSGLGLPR 


2744 


16645 


A 


2763 


236 


356 


D WL I F VFS VERGFCHVAQVGLi* LLGSGD 
PPASASQGAGIT 


2745 


16646 


A 


2764 


2 


332 


LTQTPGLK*SACLCLPECWNYRCEPPHP 
AGFLF * RHPDKGVS * tiL I YPGTGWS 


2746 


16647 


A 


2765 


297 


19 


KKKSWPFFKKKKCWGGATKPK!!5r*RGFFL 
KFVFYFFPIG*GVFFFFFFLISIIIPFP 
FFFFFFFFFFEVFFYFFFFYFFFFFFLF 
FAMKINVFY 


2747 


16648 


A 


2766 


1 


147 


QILRRLRWENCVNLGGGGCSEPRIjHHCT 
PAWVTV*NSVSKKKK]SrPLIiPF 


2748 


16649 


A 


2767 


357 


183 


NWDYRPLSPRPGKFLDF*VWRGFPLLTK 
liVSNF* PCNLPPSTSPKAGISGPNPRAQ 
P 


2749 


16650 


A 


2768 


2 


193 


RWESCLNPGGGGYSEPRSCHCTPSWVTE 
*DSVSTTTTKEKERKQ3SrRTWSESNSLQK 
YKDTNHF 


2750 


16651 


A 


2769 


333 


173 


VSQDGLDLLTS* STHLGLPKCWNYRREP 

PRPAQKLLMVIWLGFMSSPKSPLEL 


2751 


16652 


A 


2770 


3 


331 


CMENCMVIPPKKVKHKNYHMIQKFYFWK 
AVQQLSTELNIL*PYDPAITLLGIYPKE 
LKTSFRTKTCT*MLTASLFVIAKTWKQP 
RCPS VGE * L SNL * YVQTME C YS VLK 


2752 


16653 


A 


2771 


209 


55 


RPGRPQVLRRVRPQNRLNPGGGGFSEPK 
*GPCPPAWGAQRDFISKICKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQLLRRLRLQNRLNPGGGGCSEPR 
*GPCPPAWTQRDFISKKKKTIT 


2754 


16655 


A 


2773 


148 


1 


KATGRKTWVKSRCWVGTVAHTCNPSTLG 
GRGR* TARAQEFKNSLRTLAK 


2755 


16656 


A 


2774 


2 


364 


WSAVRRDWITALQPERQRETPSQKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Profine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '^=Stop codon, /=possib}e 
nucleotide deletion, \— possible 
nucleotide insertion 














KKKKKKKKKKKGGVRHGVKSRKP*TSGR 
PGRKNHWRQIQGQPKTPRERAEALKNRV 
GLKKLFKTPGHVNGAGNPKFRNGKSGKP 
PEVHLNGAW 


2756 


16657 


A 


2775 


121 


2 


HLRSGVSDQPGRHGKAPSLLKIEKLAGS 
GGTCL*SQLLG 


2757 


16658 


A 


2776 


195 


3 


GRVD INTLLALLL 1 1 ITF * LPQLIG Y I E 
KSTP YECGFDP I S PARVPFS I KFFLAAI 
PFLLFDIi 


2758 


16659 


A 


2777 


3 


285 


FUSriRNICVSLCCPS *A*TPGLKQSCCLG 
LPKC*DyR 


2759 


16660 


A 


2778 


135 


379 


HWSATVKIiYAILLGLLKWESGTLLCSKD 
VLFFL*G*SSAYSIESVIMQIKAT*VKG 

KARVQLGAKKLKAYWQRKSPGIPAG 


2760 


16661 


A 


2779 


364 


1 


GS FTGAV I L 1 1 AHGLTS S LLF CLANSN Y 
ERTHSRI I ILSQGLPTLLPLIAF* *LIA 
SLANLPLP PTINLIiGELSVLVTTFS * SN 
I TLLLTGLNILVTALYSLYI FTTTQWGS 
LTHHINNIK 


2761 


16662 


A 


2780 


356 


1 


GCLRAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTEDCINSITTQVAQIFLVHFLLRQSL 
TLS E iCRKKRKKKRKKRWR I E S PE INTH I 
YCQL I FNKGAKKI K* G *NSIiFNKTYLDP 
WISTCK 


2762 


16663 


A 


2781 


1 


204 


AQVGLQDAS S PR I EEBITFHDHALiI 1 1 F 
IjICFLVLYALFLTLTT*LTNTNISDAQD 
SETDCYSQSYIL 


2763 


16664 


A 


2782 


1 


522 


YKCNECGKVFNQQAHLAQHQRVHTGEKP 
YKCNEGGKTFSQMSNLVYHHRLHSGEKP 

* KCNECGKTFSQMSKLVYHHRLHSGEKP 

* KCNE CGNTFHHNS TLVSHKAI HTGE KL 
YKCNECGKVFNQKTTLARHHRIHTAEKIi 
YK*EECDKVFGCKSNLETHKKMQFSKTD 
SAFSLQ 


2764 


16665 


A 


2783 


1 


327 


ENRLNSGGGGCSEPRSHHCTPAWATERD 
S VSE KEKEQKQNFHLNAQSNCQKLKS P * 
KYLKHPEKTDILSKASQYNNQLTADQSI 
LKFKTKTFDQKKGRPKGHGMTYSNS 


2765 


16666 


A 


2784 


335 


128 


FSLIFCRAGILPCCPGWS*TPVLKQSSC 
SSLPKCSDYKQEPPYLACATLKGYQIPN 
FYCWPHIFKRMFY 


2766 


16667 


A 


2785 


208 


3 


RPICPLSLWGVSFLFFFFFLRDRVLICH 
TGYSAVAQYC3NTAHCYSPRLK* SSCLSL 
PSSWNHNLIPPR 


2767 


16668 


A 


2786 


49 


332 


VEMGNSPINRKYVYPKSYNRCLKCNTEE 
GVUSTDLGIAEFNTCSKSLIjIiIilFFKGRI 
LLCCPGWrEWQSQLTASSTLGLK*TFH 
LSLFGSWDHR 


2768 


16669 


A 


2787 


331 


187 


GCSEPRSPPCNAAWATKGDSVSKEKKKKT 
KKREEKKH*HKTSGEKINLI 


2769 


16670 


A 


2788 


325 


34 


RSQLRGLPSMSIS* I*NSRLR*LRPRRL 
FIFCRDRVSLCCSGWSQTPGFKQSTHRS 
LPKCWD YRREPS PLAQIRYLGLFFSQGL 
SS AFC YVAMVSGF 


2770 


16671 


A 


2789 


3 


150 


AASTRQLI FHFTSKHHFGFEAAA* YWHF 
VDW* LFIiYVS I Y WLG 


2771 


16672 


A 


2790 


86 


311 


NIHPLND I FTRLKNGFKKKE I SLVKNE P 
NKGT*SLISIKVIiFHT*K*FIRPGVAAH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last 2immo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L==Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T*NPSTLGGQGGWIT*SQE 


2772 


16673 


A 


2791 


2 


123 


GGGACSEPRSHLCTPAWETE*DSISEKK 

EEKK*LQIYTNI 


2773 


16674 


A 


2792 


302 


162 


PLLRRLRPENCFNLLGGGFN* PRSRNCP 
PAWATKRDSLLKKNLKNF 


2774 


16675 


A 


2793 


1 


258 


GGCSGLRLCHCTPAWETD*DPVSKKKKK 
KNFTHAKLiIilFGTIjVLGKVKRGGPFKPR 
ILKLPWETWQKLVFTKKKKIFFLiiCNRGG 
G£ 


2775 


16676 


A 


2794 


199 


3 


VGPIFRHDKPPHSKQLCTYSPAFTQLFS 
SGKILKWIHLWPGTVAHACaSTPS TLGG * A 
RWITRSGDR 


2776 


16677 


A 


2795 


1 


302 


GGCSEPRSCHCIPAWATEPDS I *KINK* 
INK*K* *KINRNKKGKNHFLSSST* *PQ 
VASSFLTGQHRYTKL* SSQKVLLDSATL 
QSKSNAEVKRVNRTTD 


nil 


16678 


A 


2796 


228 


326 


NDNGQSGWAHACNPSILGGQGGWI I *G 
REFKT 


2778 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV*KRGFF*NFLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKIFPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


2779 




A 


2798 


307 


36 


FKNFCCG*EVSLCCPGWSQAPGVK*-PSC 
LGFPKCWDYRCELPCLASCSL* CDRNQT 
RISAWLWIEFHLSPILPVLPLFLKKPQ 
AGLLCF 


2780 


16681 


A 


2799 


132 


5 


I FEVTV* CTK*HKRNMQPGWGHACNPS 
TLGGQGGWIMRSGV 


2781 


16682 


A 


2800 


32 


314 


KQHPGNGPNPSGKGPGRASRFLKKKNFF 
F*KTPPKKPHTPCKPSPKGKGPERLPWH 
FFNQKFGPLPMGPNPDQRAGFCLRDP*G 
GGKNRSPPTL 


2782 


16683 


A 


2801 


139 


3 


AASTFYIFFETTLIPTLAIITR*GNQPE 
RLNAGTYFLPYTLVGEG 


2783 


16684 


A 


2802 


238 


3 


AASTSHVISSMYNIYIIIQFKTFPVFFF 
RDKVSLCHTGWSAVAQS*LTAASAAIiTS 
CVQGILPTQPPEYHTRPLRLFL 


2784 


16685 


A 


2803 


110 


291 


KKLGVFGFGFGFKTKSNFIMQAKGQLPN 
LG*LKPLPRGFPQFSGLSLSGTWDYKHT 
PATR 


2785 


16686 


A 


2804 


97 


2 


SYNNQDSWnEKEKTNRSVKQN*APIPE 
KKKKKNKVGGLTLPNCKTYYKATI IKTV 
WYWRKKRQ IGQ*NRI ESPE IDPHKY 


2786 


16687 


A 


2805 


268 


1 


ATFLLFYYVFRPQIPYYL* IFILSIiF*Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDPPCRIiPIRKWWLVWLMPQLNPTVW 
PTIIT 


2787 


1668S 


A 


2806 


153 


3 


IPIjPKGLLVPLFGVFIiKVFFFFFFFF*D 
RVSPCRPDWSAVAQSRLTASS 


2788 




A 


2807 


3 


216 


NAARDHI VKPSP * PLTGALS ALVMTSGL 
AM*CHSHSISLLILSLLTSTLTIYQ*RR 
NVTR * TS YQGHHT PTDQKGLRYG 1 1 LF I 
TSEAFFLS*FF*AFYHSSLSPTPQLRGH 
WPPTGLS PLNPLEVPLLNTSELLAS * VM 
PLSLHIASNTKPTNQHTNHIPMKAQCaSTT 
INIilPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERESGFVAQAEGQGQNLS*WKPFFPI 
LKHFSCLSLLRSWNYGPTSPVPAKVGGF 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=H!Stidine, I=Isoleucine, 
K=Lysine5 L=Leucine, M==Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine5 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=:possible 
nucleotide insertion 














WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNSKCSNHRK*SIjFFPFFFERESSFV 
PQVELHGRDLG* LQLWLPGFGKFPGLTP 

liRNGDDGPRPQPPANLGLLVKTGFSPVA 
HLGAHMLGTLGD CPALP 


2791 


16692 


A 


2810 


236 


3 


KMFFQiyiRYTMLKK*QATFGFIIiALKNVL 
KTFLAPIFFFLRRHSALVAQAGVQWRDL 
GPLPPIiPSGLKRFSCLSLPSC 


2792 


16693 


A 


2811 


140 


210 


NAKITKC*KGYGKRGTLMHCCWQFKMGQ 
LLWKTGYQFLKKLKIiELLSNVPIPYL* I 
WKKGNPYALLLAI 


2793 


16694 


A 


2812 


230 


1 


VKLCPVSLKTGVRPLLTSSEVIFILNCK 
FNIGV* LLPGSLLIiLLASAI VEAGAGSG 
*TVYPPIiAGNYSH2\EPYALV 


2794 


16695 


A 


2813 


3 


345 


HEVRIDVHTRTCFTFGTI I lAI PTGVKV 
FS *LATIiHGSNMK* SAAVL*ALRPrFLP 
TVGGLTGIVLSNSSLDIVLHDTYYWAH 
FHYVLS 1 6TVFAI IGGFIH* FPLFSGYT 
LD 


2795 


16696 


A 


2814 


2 


184 


ARVGFHHI DQAGLKLLTL* SAYFGFPKC 
WD YRHE P PRLALKFNNI KKKKKKKKKTP 
GEKI 


2796 


16697 


A 


2815 


276 


3 


ARVHRIDHGLM*HQPLGLK* SLCLSLPS 
SWD**SMQPCPDDYGTRSDQKKPTFKSP 
YVSQTGLEHLGSSDLPASASQRAKITGM 
SHHALV 


2797 


16698 


A 


2816 


3 


94 


HENQYSSniIIMAIAIKLGIAPFHF*VP 
EV 


2798 


}6699 


A 


2817 


93 


1 


GGGGCSEPRLHHCTPVWVTE ■^NI'VSKKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSEP 
KS GHLTPAWAT * PDLVSTKS CTLI YLVT 

QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF * GVFIiAQGLTIiVAQAEVQWHDLGSL 
QPCPPRLKRLV 


2801 


16702 


A 


2820 


2 


347 


ARAPGFAENEVFVFSSCFVWMVPGGKGE 
NAELMQPSSY*ESQHPPPAPPRCSSCV* 
LCSLGPSLLGTIil FCQFTLSELPRFRKG 

*FSSSLKKVFCFGDRVLLCHPGWSAWQ 
TQL 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPVQKGLPYGI 
ILFITSQVPFFT*FF*TFYHSTLTPTPQ 
Ii*GHWPPTGITPLNPLKVPLLNTSVLLP 
SRVSIT^AHHILI^NNRNQIISSLLITI 
LLGLY 


2803 


16704 


A 


2822 


261 


400 


VEHSNSMKENFLGQGTGCHACNLNTIiGG 
RGGRITWRSGV*DQLDQH 


2804 


16705 


A 


2823 


273 


97 


LNTPRMQRLTQLATVILRFHKDQGFAML 
PRQVSNS*AQAICHLGLPKC*DYRREPP 

GQK 


2805 


16706 


A 


2824 


159 


355 


KKFFSIFFFFFFFF*DRISLCHLGWSAV 
VQSWLTAAST 


2806 


16707 


A 


2825 


67 


338 


LSPEL*LY*IF*KPFS*QLSLTSHSPSL 
FSALLLLSISPPTWLHSQSDKNLHLFSF 
LFRDSVIjIjCCLGYSAVT*SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 


359 


HEQKYSFLHDSQTLFGF*DS IPTPSNMD 
ETQQKSRIiELVRISLLLIEPWLEPERLL 
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nucleotide 
sequence 
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sequence 
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SEQ ID 
NO: in 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alan ine 
C=Cysteine, D^Aspartic Acid, 
E=GIutaniic Acid, F-Phenyla Ian ine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, X^^possible 
nucleotide insertion 














RSMVANNLVYDTSDSDDYHIiLKDLQEGI 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 
SHlvTHDAL 


2808 


16709 


A 


2827 


3 


338 


LERNLDTTFLDPAGGGDPILYQHLF*FF 
GHPEAYILILPRFGIVSHIVSYYFGRIiE 
PSGYLRMG*PIISVGLLGFIV*SHHIFS 
VGVDIHARAYSYFGSIiVMAIPCTIjEVFT 


2809 


16710 


A 


2828 


1 


406 


RHEGEKDRRPTFGPRHRGAGTAKMSASL 
VRATVRAVSKRKLQPTRASlIiTLTPSAVN 
KIKQIiLKDKPEHVGVKVGVRTRGCNGLS 
YTLEYTKTKGDSDEEVIQDGVRVFIEKK 
AQIiTLLGTEMDYVED * LS SE FVF 


2810 


16711 


A 


2829 


332 


3 


GIIVMQATIATAIiIGYVLPIjGQISL*GA 
TVIAYIIAAI PF IGTDLVQ* I * VGCS VD 
SPTVTGFIjAFPFVLAFIIAAVAALIiLRF 
LQGTGSPNPLGGASQSDKIAFQFSC 


2811 


16712 


A 


2830 


186 


349 


YSEREVAFFFFGNGFCF*TQAEWNGGNL 
G*LFLLPPR*KEFSGKIFPRTWNYSP 


2812 


16713 


A 


2831 


303 


590 


NILTTLLNKQSKSNQLQNKIIHTVYIKI 
KIFFFFLRQSLSVTQADVQWHCLCSLQP 
PPPGFK*FSCLCLL,SSWDYKHAPLHPAN 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 


370 


EEIiITFRDHAVI ISFLICFLVLYALFLT 
LTTKLTNTNILDAQDIQTV*TILPAIIL 
VL IGLPFLR ILYI TDE VDDPSIiTIK*NG 
HQWD*TY*YTDYGGLIFNSYILRPLIjIjE 
PGELRLRDVDN 


2814 


16715 


A 


2833 


207 


1 


QFF I FLRHS FTLFAQAGMQWRDLGSLQP 
SPPGFK* FS YLSTLRGIARKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


1 


DNLxAHKGKTRVYLKSRNKLGKGGGAGNL 
VSLDS IGSRRDHRRAPPGPANFVFLV*M 
GFPHVGQDGPELLTSCSC 


2816 


16717 


A 


2835 


1 


321 


GTRKPS P * P VTGALS ALLMTS GLTM * LH 
FHS ITLLILGLLTNTLTI YQ* WRDVARE 
STYQGHHTPPVQKGLRYGI ILFITSEGG 
LFAGFF*AFYHSSIiDRTPQLGGQ 


2817 


16718 


A 


2836 


57 


370 


IWMGRVLWKDRLYGVFCRAFNRAVTSRV 
WAEDPWRVPKTLSVDPRKLPPFS*ISVR 
GQI YFT* FFFFFETESHTVAQAGVQWWG 
DIjG*LQPSPPGFKRPCLSLP 


2818 


16719 


A 


2837 


2 


345 


ARAHRQL.DEP*L*RRPGERHPSW*SEET 
VERQRTKT*SESSQ'37GTS ITSSRNARRR 
ESEKSLHLETLNKEEDCHSPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPARSQ 
SS 


2819 


16720 


A 


2838 


15 


376 


AKIEPLYSSLGNKSETPSQKGKKKKKKK 
KGKGGGEKKKKKKTEGGTRGLTQETQFF 
GNPKGPKHLGARN* KHRAKQAKRGNQPR 
SYRFTEEKKGEKI.FNPGGGRSQKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELIASIIiLIY* *CRDVTRERIIPIAHH 

TPPVQKGLRYGI ILFITS * VCFIAGLF* 
YLYHSSLPSTAQLRGHWPPTGITPLNPL 
BVPLLITSVLLASLCSFI*AHHSLIEH 


2821 


16722 


A 


2840 


338 


3 


KI IFYIiFFLRS I AFLAQIEGHWAVFS S F 
KliWPPGFKHFSASTFLVNRVFKGGPRAR 
VNFGFFIKSGFFPIF*VGF*IiFFFFFFE 
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Predict- 
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nucle- 
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ding to 
last amino 
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residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, F=Phen y la Ian ine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K==Lysine, L^Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TESRSVAQAGVQWHDLCSIiQAPPPSSC 


2822 


16723 


A 


2841 


2 


149 


ARGCSEPTLHPCTPAWVTE*DSISKKKK 
KKVECSNGNVCRTMCILCCFW 


2823 


16724 


A 


2842 


348 


175 


SACKWSITGMHCHA*VIVSVLLV*TGS 
LHVGQVGLELRTSDDMHTLASVSGGITG 
MS 


2824 


16725 


A 


2843 


338 


118 


DPGGGACSEPRLRHCTPAWVTKKDFVSK 
KKKK*NIKC*HAKSGTVLYSFIiYIiFYS* 
YFLMRWSVHIPGPSTK 


2825 


16726 


A 


2844 


189 


47 


KCVALIiLSLFLS CS VM* RNAC FRFAFCH 
DCKPLEASPAMLPIQSVEL 


2826 


16727 


A 


2S45 


335 


139 


EVSPSWPG*SQTPDLKRSACLSLSKCWD 
YRPGVSHRARLNGSFLTQLYEMLTYFPI 
IWVTIiQVFR 


•2827 


16728 


A 


2846 


3 


352 


HEQRLTPEWKKAATALGDWKVGAVDAD 
KHHSLGGQYGVQGPPTIKIFGSNKNRP* 
DYQGGITGEAIVDAALSALRQLVKDRLG 
GRSGGYRSGLQGTSYRSSKKDVIELTND 
SFDN 


2828 


16729 


A 


2847 


2 


269 


ARGLFSTISfHKDIGTLYGLFGA*AGVLGT 
ALS LL IRAELGQPGHLLGRDD I SRIHVK 
RI IPGCGTIiDGISVGRECACLWTPACSR 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFIiPN* KLAGHGGRPL * FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPIiTWATETDPF 
SREKKKRRRREEKAEKRRENRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHVGQDGLDLLPS*SAHLGFPKCWD*R 
CKPPCPASRNFKTLiKLYHIFSNNLIE*N 
*KS I TRGTLETVQtiPGN 


2831 


16732 


A 


2850 


224 


3 


SILQAWLKVPSPPGNPLGLLQSQVDFFF 
*RQ6IiAriSPGLiDCSGMIMAHCSLKLIiGS 
TDPPTIiAS*VAGTPSSC . 


2832 


16733 


A 


2851 


27 


329 


VRTAAINRPGFLLPCFIGQNFFPY*RFE 
TYRLGV'»^MILTPH*YLLSSLSGLDCVIP 
FSFILFFYWFIVYCHVFMVTLFPFLFVI 
LFVLCVFWCFYFLLFV 


2833 


16734 


A 


2852 


351 


60 


NliliSSLQGGRLPRPPGLSPHPQIFTPHG 
GVPL*IiPGPWKSRPEASLNPRS*PPPGQ 
PRGNPPPPKKNFFFFETVLLCRPGWSAV 
AQFQKKKNLLIC 


2834 


16735 


A 


2853 


186 


365 


LTQLLNIiI*YQNQIEYPQPYAQCGPRSG 
NAQQIGHSFPNG*QVPASGMHGLiPWTHQ 
GLKK 


2835 


16736 


A 


2854 


17 


239 


HFTTLQISMSLLTETLARHDSTHLWSQL 
L*RLRR]SrDLLKSGAGGCSELGLHHCTPT 

WTTEQDPVSRPGAVASEH 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQDGLDLLTS*SACLiAFPKCWDY 
RHKPPCLAAYIFIQSRVFIiAVISISTHS 
SFQLL 


2837 


16738 


A 


2856 


1 


157 


GTRGFHHVGQAGLEVLTSDNLPVSASQS 
AGITGESH*AHERSDGYANTNSPG 


2838 


16739 


A 


2857 


157 


361 


LLLRIVSLFWRKSSPLGVQAEMYRPDLP 
VTKPPPPD*DKDSCi:iSLIjITWDYRLAPP 
LPANIjEF*I*TG 


2839 


16740 


A 


2858 


3 


346 


HEGraKFKI INVHLR*LNQNL* SGLEGD 
KVIWRNI *TS VTFIiFC*MNGKMNPDIiFR 
QFTYEVGRIFFFSLKD*VWLCCLGSSTV 
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NO: of 
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nucleotide 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AJanlne 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine> I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valjjie, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














DLiGSLQPPPLGFKQFSCLSLPSSWEPPC 
PA 


2840 


16741 


A 


2859 


18 


233 


BCNPTILCCKGGGAQA*KPRLLGRLRQEK 
GLTPGGEGCRDLKSPLGIPPWATRPKLL 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 


356 


3 


RVNFKI FGKKGFCPGGPGGFE VLTPGDL 
GPLFFQKGGDS PS KPLKPFFFCFALKS T 
TLNYFQFAFCVLRPTPN*SVTQAGMQ*H 
DHSSLQPQPPSIiK*SSQLGFPSSWVYRH 
APPSC 


2842 


16743 


A 


2861 


115 


374 


I VCGKIIiNV* YTQ * HNKLQNKNKVD* PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGWPGPVAYVCNSKTLGGQGGQIT 


2843 


16744 


A 


2862 


163 


360 


LKRIWLLILiHSHPHTSPHDRGVTQIjEAI 
GPRWY*TYEYTDYGGLIFNSYILPPLFL 
EPGDLRLLDA 


2844 


16745 


A 


2863 


1 


236 


GTSPFFSFFFLSFFPSFLPFLSFFNPTT 
IiLLSLHSNIS*KKVLFSFEMESHSVNRL 
ECSGT I SAHCSPAWATGTDS I S 


2845 


16746 


A 


2864 


67 


364 


VCVCVCVCVSVSVCVCVCVYICTYICMY 
VCA*YRIRRYIMLPTLVYNSCPEIHDSK 
S CALHTGAITRATR * P GLC IHI P VCLT V 
CLTVWLHI CLS VCMD 


2846 


16747 


A 


2865 


2 


155 


ARGLTQENRLNLGGGGCSELRSHHCIPA 
WATE*DALSQKRKKKKGTYRGI I 


2847 


16748 


A 


2866 


344 


81 


IQPNSFIFLKIFISRDGALLCCPGWSPN 
SWAQQSSRLNLPKCWDYRHEPPHIiWQF 
L*KHYMYKWHKVHCVFSHYNKIFTGQTK 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSIiRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 


A 


2868 


3 


349 


HEATSPIIQELITFHDHAEjIIIILICYIj 
LLYALYLTLTTKLTITIMSDAHEIHTA* 
TXLPAI XLDLIDLPCLRILYVTDEVNDP 

YLT I TS I GHQW Y * T YEDTD YGGL I FNS Y 
ILT 


2850 


16751 


A 


2869 


1 


352 


GTRIDVYTRAYFTSATIGIAIPTGVKVF 
S * LATLHGSNMK* SAAAL*ALGF I FIiFT 
VSGLTGIVTiADTSSIiDIVLHDTYYWAHF 
HYVLS IGAVFAI IRGFIH* FPLFSGYTL 
DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRTSIiLLAFALL*LPWIiQEAGAAQ 
TVPLTTLFDHTMLQDHRAHQLAIDTYQE 
FEETYI PKDQKDSFLHDSQTSFCYSDS I 
PTPSNMEETQQKSNLELLRI fllliesr 

li 


2852 


16753 


A 


2871 


254 


2 


ypgyqaisqsdmqsyayrhl*lqppgvk 
* ss i vs lls s *dhrp vppclane kffff 

*RQGIiAMLQSRLVLNSWLLLYSHAEPRA 


2853 


16754 


A 


2872 


282 


3 


hthilhyiyihhghist^hlspptdtiys 
lcnikiliytpydh*iiriplhqehavh 
rrrlihihihtsvcvcvcvsvcvacvcv 
cvcvwvlv 


2854 


16755 


A 


2873 


278 


3 


LWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP * KKFFRKF* KWPGFFFFFFLRQS 
HSIAEAGVQWRDLGSLQPLPPGFKRFSC 
LNQISSC 


2855 


16756 


A 


2874 


2 


348 


ARA*SLILVSLIIPNATSlSrLLGLLPYSF 
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Predict- 
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nucle- 
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correspon 
ding to 
last amino 
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residue of 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
0=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPTTQLS INIjAMDI PL*AGTiy[VIGYSSK 
IRHALAHFLPRGTPTHLIP ILAMIESMS 
LLIQP lAMAVRVTDNITAGHIiTMPLTGS 
PTV 


2856 


16757 


A 


2875 


1 


260 


GTRENHIiDPAGRGCSKSRSHHCIPAWVT 
D*DSLKKKKKKKPQ I ILGNGGTEGNQK* 
RGNPFRGPWGPSRQKrKELGPGGKHGKS 
KF 


2857 


16758 


A 


2876 


176 


1 


EHLFCKKLLGEEKGSriSKEFGKCWVNPR 
*KWNSRPGWAHTCNPSTLGGRGGQITR 
SG 


2858 


16759 


A 


2877 


168 


1 


GVRVF6PS S VPE FS FDTCAGAQWRNLGS 
LQPLPPGFK*FSCLSL»LSSWDYKRTPRA 


2859 


16760 


A 


2878 


67 


333 


WQGLGRAVARVPRSYEITWAGNLKGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFFFFPFPLFSFFFLFFPPPFPFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRREKEKERKKKKERKKERKKTI F 
KKHI PS PLAF PRKAWKLKE I Q I KLVAH * 
KINRQAIRGCPPNKRPHPFQNMIKPGRE 
EGEGGIiETSRDGERNLiWIFGPSVGPLR 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFNNILSGQ*TIT 
NTTNQYSSLII IMAIAIKLGIAPFHF* V 
PEVAQGTPLTSGLLLLT*QKLAP I S I I Y 
QISPSLNVSLLLTLSILSIIAGS*GGLN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVLIiVEMGFFHSGQAVLKLLASSDP 
ASiyiTSQSSGITGIjRHCAQPR*AFLIALC 

LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


IKKAL*HE*LREKKKSPIRTSQSSKAP* 
NLPRSKKKKESSNNYQLL*AHKLQKVKE 
MGKLLETPNLPLLS QKVAE PLNQP r TS S 
KIETLIKKPYHPEKSYGHDEVTAKFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMFIFYMFAAYLATRSCPFTLA 
GVQWHNHGSLQIi* PPGLKESSCLSUIjSS 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCREPRSRHCSPAWVTE*DSKQNKTK 
QKQKRYTNI S SPQTS PLCYS FS AYTPLH 
NMPGNIMFSSL 


2866 


16767 


A 


2885 


360 


247 


FHRVSQDGLDLLTS * SAHLSLPKCWDYR 
REPPRRPKAL 


2867 


16768 


A 


2886 


229 


351 


KUSINRPGWAHAFNPSTL* KLNNRPGW 
AHAFNPSTLGG*GGWIMRSGVRDQPNQH 
GETP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVPCRQDVQVPHYLEGLF 
LRSCFMEIKYDT*KRKIKYTHVHTHTHT 
HTHTHTQREKERDFPRFTAKWKANLEAG 
SGYATASS IRAVLWTTITLV 


2869 


16770 


A 


2888 


4 


305 


ADSHMWKYKAPGI TKILLKKSKI * GYHQ 
DWI S SRYQDLFHKDMI FKIMS YWYKKRQ 
VDHWDRIESPETSSQMYEHLIYMKDNIA 
EW*EKDSFIiSKWCWDN 


2870 


16771 


A 


2889 


1 


436 


IAILT*YDYTLL*RVGPRGQEFGTRATV 
ITNLLSAIPYIGADWK*G*GGYSLDSP 
TLiTRFFTFHFILPFIIASLiATLHLLFLH 
*TLSNIPLRITSHSHQITFHPYYTDQYS 
LRLIiP FLLSFTTLLL I LPNFLFYPP * TT 
LTFPF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R^Arginine, S=^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2871 


16772 


A 


2890 


1 


218 


RKISPLrKLINHSFXDLPTPSNISA* *N 
FGSLLGACLILQITTGLFLAMHYSPDAS 
TAFSS lAHITRDVNYG* I IRYLHANGAS 
IFFICLFLHIGRGIiYYGSFLYSET*NIG 
I ILLLAT I ATAF IG YVLP * GSLLGACL I 
LQITTGLFLAMHYSPDASTAFSSIAHIT 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSNS * ANNLI *LiAYTIAFI VKI PLYGLH 
Ii*LPKAHVEAPIAGSIVLAAVLLKLGGY 
GIIRLTLILNPIiTKHIAYPFLVLSL*GI 
I ITSS I CLRQTDLKSLiIAYSS I SHIALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI *IiAyTIAFIVKIPLYGLHIj*LPK 
AHVEAP lAGS I VIiAAVLLRLRG YGI IRL 
TLILNPLTKHIAYPF3:iELSL*GIIITSS 
rCLRQTDLKSLIAYSSISHlALWXAIL 
I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSIiIAYSSISHIALWTA 
IIiIQTP*SFTGAVILIXAHGLTSSLLFC 
LANSNYERTHSRI I ILSQGLQTLLPLIA 
F* *LIASrANLALPPTINLLGELSVLVT 
TFS*SNIT 


2875 


16776 


A 


2894 


229 


3 


YTF VQS F I FS WIjF I CKVCLL * PAHSMV 
CACECMCVCVCLCVCnSSLKI IPLiLECIi 
I YLQL I L VLSSAI YNSRFE 


2876 


16777 


A 


2895 


3 


391 


GFL I TNN I S PAS P FQTT I PLYLKLTALA 
DTFLGLLTALDLNYLTNKLKIKSPLCTF 
YFSNILGFYPS ITHRTIP YLGLIiTSQNL 
PLLLLDLT^LEKLLPKTISQHQISTSII 
TSTQKGI I KLYFLS FFF 


2877 


16778 


A 


2896 


381 


2 


GRCRVSGS I IDHRMAPQKWR YTKGGPHQ 
WHNFCFF*KKGVFPYGPKVFLFRAPVFS 
PPGPPKRWSIKV* TPPPALFFFFFFFFV 
EMGSHCVAQAGLKL.LGSSTPPTSASQGA 
E I AGVSHRAQPHAS 


2878 


16779 


A 


2897 


324 


1 


LYTISrNTKHINPQQiqiSrTK*YQRNTKNRRK 
KEQQTKNNIKRHITESARKNS* *HRNIQ 

KANKSQQSNKHYRARSRHNisrisnsnsnsnsn^^ 

KKKKIENSHAASD* ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


PPAPSS*YTTSLIQDRLFL.IVIMAVLSSAS 
LMRGNVGSNIMNALSHFLPQGTATLFIP 
VLD I ME AI S LiL I QP I ALAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDIjLiTS * STRLSLPKSWDYRCE 
PPRPAGSGHLYSSFKKQTGDTHHNF 


2881 


16782 


A 


2900 


1 


224 


LSWCIiTLYFAYLLALFYFYPLKILDLAI 
LPRLVPYSWP*ASSPASASQSSGirGMS 
HYTWPLPQTLNPFLFFVP 


2882 


16783 


A 


2901 


1 


228 


KYrjINNRLITTQQ*LrKIiTSKQMiriHlSr 
T * GY* YNRSGSS FNGVYDYHLLRSESEH 
PWMIVDNTEYDEIYTRGGIE 


2883 


16784 


A 


2902 


2 


332 


LPPSFSLLLAPSPSSSLPPSPSSSLPPS 
PSSSLPPSPSSSIiHTSHSSTLTPSASCT 
riTITESFTQRLAQSYITILP*ASRTTMQ 
LLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 


DIGFGTDFSDMTPKAQTKQVDQLDFIKI 
*NFWPGTVAHTCNPSTLGGRGM* ITKLG 
V*DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 


179 


2 


CGYFNAYI CYSLLCFLYIiSLCNQPFWKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I==IsoIeucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X~Unknown, Stop codon, /==possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














I FLPLFGLVFFFFFF *NRV* FCCPGWSA 
VSQ 


2886 


16787 


A 


2905 


108 


2 


GRVDIKLTSKQMr'riHNTKGRT*SriIIiV 

SlillFIA 


2887 


16788 


A 


2906 


318 


2 


y KE SNS KNNE KKK I TRQRKRQ KKNRP SM 
MVHT CNP S PMAGRDGQ I PS I LFNQPHS P 
RI IRLTL.ILNPLTKHIAyPFLVLiSIi*6I 
I ITS S IGLRQTDLQSLIAYSS 


2888 


16789 


A 


2907 


17 


308 


KLAGYGGMCLWSKL.LERPTRMNHLSPRS 
EGLMEP*SHlSrC I PDWTP* QNPVSVQI TG 
FCFLTPNLAMPPRLNLSGPICCQLSLRL 
LGPRLFAASVSGV 


2889 


16790 


A 


2908 


378 


1 


RSLHSTETAAKTHGHQTHPGIiSTPSK*T 
PTP IAS IPI PNPHWCLPAFNS TTSAS YF 
SBCVLTSSFKEI lADFR* KYSSTN*HPY 
HLFLLKFVFFCSRDGSLTMLP IRPGLNS 
RSQAIIiPPWPPKVXi 


2890 


16791 


A 


2909 


1 


129 


HLFGTNHRDIGTIiCLLFGA*AGVLGTAL 
SIiIiIRAELGQPS TRP 


2891 


16792 


A 


2910 


1 


323 


CVCWGYRIPRCNIGHHLKFFFFSLFFFF 
FGNKVWFCSEG*RAGDQLLIMEPrAS6F 
KGI FFLNLPQNWE *RVSPKIiPGKFWlF* 
LKTGFPLVAQWFBriRTSGDQMA 


2892 


16793 


A 


2911 


367 


136 


ETPGNHLSPGV*GCSEIi* SCLCTPAWAT 
EQD P r S KQ KRTKKS KTLLKNTKADIiTRW 
KHIHKSACVCICLCMYMCLRE 


2893 


16794 


A 


2912 


258 


140 


FLIFFCLDCFLNIKVLGVPGFFFSLKKK 
QT * RQGLALS PRLE CSGL 1 1 AHCTLE PL 
GSRNPLTIiAPQIAGATGMSH*VQGYNEL 


2894 


16795 


A 


2913 


48 


242 


y SMI F CVL I LP VS FTSSDGI FLLINLFL 
FY* STYPLAFFKTESHSVTQAGVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GFI ICVCVCVCVCVCVCWFIFTEYSFH 
SLFC* IRCCVFYFLALSLLIGFYLFWNR 
YLEVYNISFEVWASLFNRFLLL 


2896 


16797 


A 


2915 


1 


122 


FSVETGFHHVGQPGL*PPTSGDPPSLAS 

QSAGITGMSHHA 


2897 


16798 


A 


2916 


167 


1 


GNMCSKESVSGTNINRKPD* * YNPRLGT 
VAHAYNPSTLGGQGRWITGGQEPKTSL 


2898 


16799 


A 


2917 


2 


356 


ARGILLLRIMLTTIjTRYP* *RDVTREST 
YQGHHTPPVQKGLRYGI ILFITSEAFFL 
PRFLPPFLSSSFPPTPSLLKPFPPSSFP 
SLPSFFPPFLPSSLFL 


2899 


16800 


A 


2918 


2 


351 


AREYTSLQLILQMTFIMAFTCTDRTLYE 
LAFEC*LMRTLDirTR*GNQP*RPNAGS 
YWLPYTLLGTLPLLIALIYTHNTIjGSIiN 
ILLLTLTAQELSNS*ANNLI*LAYTIAF 
IVKI 


2900 


16801 


A 


2919 


32 


321 


AliMGITFFFFLGKGVLAPRGGGRGGNPG 
LWGGPPPGLGPFSG*SLQGGGA*GPPPQ 
AGAWFGFLjILKKTRGSPGCPGGFWXPGP 
GDRPAGAPQGGG 


2901 


16802 


A 


2920 


1 


361 


GTSTRLGVLLLSLHHAGS INPLGITLHS 
DKVTIiHPYYTIiCDALGriLIiFLIiSLMTIiT 
LFSPDLLGDPDNYTLANPLNTPPHIKPE 
*YFLFAYTILRSVPNKIiGGVLALLLSIL 
ILAIIPIL 


2902 


16803 


A 


2921 


3 


152 


HERLYSVPLRILRRLPDPLS I PQGWRHV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *='Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRESAYKSHLTLPARRGLQDGI ILFITS 
EWFFAGFF*AFYHSSLCPTPQL*GHWP 
PTGITPLNPPESPLUSTTSVLLASGWIT 
*AHTKAISHYLREEAFKTG 


2903 


16804 


A 


2922 


410 


1 


RAGYRQKWGSLATVIGQLGLPVERGWYG 
PSGEGGTPQRHCSELVLDDLHVITQNRA 
NHRPRCGGSLLSELKFAPIiALQPGRQSK 
TLSQKKNKQTTTKKNTKKQPGSVAHACN 
PSTLGG*GRQITLPSGVRDQPGQH 


2904 


16805 


A 


2923 


2 


145 


ARAARGDVI * S FHVFAQRDLNLLSSRDL 
PALASQSTGITGMSHQCPG 


2905 


16806 


A 


2924 


29 


340 


EMNEVERRGRK*LFHDSIQSR*EYRCAP 
QCPANS CVLM*RWGFAMLPKIiVSSDLPA 
LASQSAGITGVSHCAWLVFLPLLSTSFI 
SQRHLYLPRA 


2906 


16807 


A 


2925 


2 


337 


ARVLILPGFGIISHIVTYYSGKKEPFGY 
I GMD * AMI S IGFLGF I V* AHHI FTVGI D 
VDTRAYFTCAT 1 1 1 AI PTGVKVFN* IiAR 
LHGCNM I * S AAVL * S LGF I FVF I VSGQA 


2907 


16808 


A 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTFEPE 
SSFPVCLEMGGGGNFISLFFLFLIILSL 
FFYFY* FFFI IFVYSYYYFYFCLPSYFF 
FYFFINLIIFLLLNYFFFFI 


2908 


16809 


A 


2927 


48 


374 


KDQEPTDMGSAHFQVFKGWRQVGGANED 
RINSHGGPVMFSGQYYGQLVRFLVY* I I 
IFFMLLFFSLFYSYFLFISIFFVLVFYC 
LFIFLFYFSFFFIFVFFFYYFIVXF 


2909 


16810 


A 


2928 


3 


294 


HEGFHYI S QDGLDLLTS * S ARLGLPKCW 
DYRREPPRPARFPTLRIWLYILKALiYTV 
LGCYREYEAYRPWCWTHRMDGGQEVEGR 
REGTGGGHILGF 


2910 


16811 


A 


2929 


7 


420 


IRRRDVLLLTLTSLSPTAARICYNGGRR 
GNRIiNSIiAYLSKEIiLAAWSLRKPSHGLL 
TPIRCVLYXRGHHF*IjLPPSHLLLLASA 
IMGAAGGTG*TV*PTLARNYSQPGVCVN 
lAMVSLHLSGVSS ILGAITFITAIMNME 
APAITQYQTPLFV*YGLITADLLVLSLP 
DLTAGITIIjIiTDRKLSTTFFDPAGGGDP 
ILY 


2911 


16812 


A 


2930 


11 


253 


GLLHKAPSP *KFFFS PKPFNFFWKFSP I 
FSPPKKKFLSKNPHIVFKFPPFKGKIFT 
FPPPLKFGPPRVFFKAPPPFFPPFFFFF 
PPFFF 


2912 


16813 


A 


2931 


70 


341 


DSSFFS*IiATIjHGSNMK*SAAVL*ALiGF 
IFLFTVCGLTDIALPNSSLDIALLDTYY 
WAHFHYVLSLGAVPAIIRGLIH* FPLP 
SGYTLDQS*AKIHCAIIFIGVYITFFPH 
HSLGL 


2913 


16814 


A 


2932 


3 


340 


NYSHPGASVNLIIFTVHLAGVSCILRAI 
NFITTIINrKPPAITQYQTPLFV*SVLI 
TGVLLVLSIPVLSAGrTILIiTDRNLNTT 
FFDPAGGGDHILYQLLF*LFGHPEAYIL 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRYWIILSITSEVLLFAGYF*AL 
YHSSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYT YVLLASRVS FT * AHHRL I QNNRN 
HI IQALL I T I LLGL YCTLLQAS *NFEP P 
FTISDGIYG 


2915 


16816 


A 


2934 


2 


367 


PRVRPRVRYL,LFGA*AGVLGTALSLLIR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib{e 
nucleotide deletion, \==po5sib]e 
nucleotide insertion 














AELGQPGNLLGNDH r YNVIGTAHAFVI I 
FFIVI PI I IGGFGN*LGPLI IGAPDMAF 
PRIKNISF*LLPPSLLiLLIiASAIVEAGA 
RTG*TVYPPL 


2916 


16817 


A 


2935 


3 


365 


YHIV*PNP*PLTWALSALLMTSGLTM*F 
HFHS I TLVILGLLTNTIiT I YQ * WRDVSR 
* STyQGHHTPPVQKGLRYGIMLFIT*DR 
FFFAGVF*ALYHSSIiAPTPHLGGHWPPT 
GITPLNPLE 


2917 


16818 


A 


2936 


3 


144 


DSHGRHVING*TCVTICFIRQLIGHFTS 
KHHPGFEAAA* YWHKKKKK 


2918 


16819 


A 


2937 


3 


422 


QRLLATNHHDIGTLYLSFGA* PGALGTD 
LSLLIRAELjGQPRDLIiGNDHIYNEI*TG 
HALGIIiFFIGLPI IIEGFGN*LHALI IG 
APDiyiALPRINNISF*IiLPPYLLLLiLASA 

IVEAGAGTG*TVYPPLAGNYSHPGASVE 


2919 


16820 


A 


2938 


3 


382 


RTRGLFSTNHIDILTLYLLFGA^AGVLG 
AALSLLIRAELGQPCNLiLCNDHIYNVIV 
TAHAF VI IFFIVIPIII* CFGN * LLPL I 
IGAPDMAFPRI]SINISL*LLPTSIjLLLIiA 
SAILEA*SGTG*TV 


2920 


16821 


A 


2939 


225 


3 


NIS WRMKQ I VPKGE P YNGVP VF VSLWFH 
KMF I FETGSHS VYQAGVQ * RHLGSLHPH 
PPGIKQFYIiSLPSSWDY 


2921 


16822 


A 


2940 


1 


247 


PTRPPVIYSTIFAGTLITALSSH*FFT* 
VGLE INMLAFI PVLTKKINPRSTEAAI K 
YFLTQATAS 1 1 LL I AI LFNNI KKKKK 


2922 


16823 


A 


2941 


3 


369 


TRDSTYLGHHTPPVHTGLRYWRILFITS 
DAFFFAGFF*AFYHSSLAPTPRLGGHWP 
PTGITPLNPLEVPLLYTSVLLASGVSVT 
* AHHS LVEDDRYQ 1 1 QALL I T I LLGL YF 
TLLQASEDFE 


2923 


16824 


A 


2942 


3 


401 


LTVCCVIVLRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTML 
TNL I KGKVPKAP PRI IjIVIGGGINMTF*GF 
VTPRAPFPLTLRLNPMLQQGIDLLTLKA 
SGGSPASGNFLKELGLRSINF 


2924 


16825 


A 


2943 


189 


348 


RNARGWVTDKEKRLRLGMVAHACarSSTL 
GGDGR* IA*GQEFETSIiANMVKPCL 


2925 


16826 


A 


2944 


2 


349 


ANSNYERTHSRI I IIiSQGLQTI.IiPLIAF 
* *LVASLANLAriPPTINLLGELS VLVTT 
FS*SNITLLLTGLI ILVTALYSLYIFTT 
SQWGSLTHHIimiKPSFTRENTLMPILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GCSSGTGCCP ILCDLPRPWS CRGVGGS P 
SSTAHLCPRGWRSGRCFLPPLSAS*VDS 
AMStilQAAKNLMNAWQTVKASYVASTK 
YQKSQGMASLNLPAVSWKM 


2927 


16828 


A 


2946 


379 


395 


S QHFGRLRREDHE VRS S * P RDP PALTS Q 
SAGITGMSHCARPLVATSIHKIDNRYID 
DRS*NINIGTIFINLLIFYLSIYLSIYL 
SIYLSIMYSFTVAQAGVQWRDYGSLQPE 
SPGPK*SSGISLSSSRNHSHTPPHGWVD 
PKIP 


2928 


16829 


A 


2947 


2 


357 


HTYDIRKPRR^TLACAIiSALLTTSGLAM 
TDCFH Y I TliL I LCLLTNTLT I YQ * WRDE 
TRESTYQGHHTPPVQKGLRYGI ILFITS 
EAYFFAGFF* AFYHS SIjAPTPQLGGHWP 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=IsoIeucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Ty rosin e, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, \~possible 
nucleotide insertion 














PTGITP 


2929 


16830 


A 


2948 


2 


337 


PRVRYSTDHSDIGTLYLLFGA*AGVLGT 
AISLLIRAELGHPGNLLGNDHIYNVIGT 
AHAFVIIFFIVIPIIIGGCGN*LGPLII 
GAPDMAFPR I WNI S F* LLPAS VLLLLAS 
AIVEAG 


2930 


16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRLS 
SGG * 6CSBLCS CHCTPAWTTQQDCIjKKK 
KNFWPVGEVKGRHML 


2931 


16832 


A 


2950 


2 


354 


AIPMTTARLTIHEAYIillLERTTTTTKD 
VKNPRRIAAAITASCLGGGLEDAISCQY 
GIATKDRPTGLGTPEVLLGALPGAGGTQ 
RLPKMVWPAALDMRLTGRS IRAHRAIiK 
M*LVD 


2932 


16833 


A 


2951 


3 


150 


LFSCSPTFSSDPIiTTPLLILTT*LIjPLT 
IMASQRHIiSSEPIiSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKKKKKKKKKKKKKKKKK* KLRKKKK 
KKKKKKKKKKKKKKEK 


2934 


16835 


A 


2953 


3 


378 


DAWADAWNQTPLFN*SVLITAVIiLLLSL 
RVLAAGITILLTDRNLNTTLFDPAGGGH 
PILYQHLF*CFGHPEVYILILPGLGIIY 
RI VTD Y YGT KE P FG YIGMV* AMRS IGFL 
RFIVRAHHIVTVG 


2935 


16836 


A 


2954 


3 


387 


IL YQHL Y* FFGHLEGY ILILPGFGIISH 
MVTYYSGKEEPFGYIGMG*AMISIGFLG 
PIGRAHHIFTVGIDVDTRAYFTYATI II 
AIPTGVKVFS * LATLHGSNMK* S AAVL * 
ALGFIFLFTVCGLTGI 


2936 


16837 


A 


2955 


2 


432 


PRVRPRVRKHIAYP FLVLSL* GI IITS S 
ICLRQTDLKSLIAYSSISHIAIiWTAIL 
I QTP * S FTGAVI L 1 1 AHGLTS S LLFCLA 
NSNYERTHSRIIILSQGLQTLIiPLKKKK 
KKKKKKKKKKKKGGGLLKESLGGPNLTG 
EGK 


2937 


16838 


A 


2956 


12 


391 


S YF I S S S KPHLS P P WLS S PDEATSKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGFK*KRIGGSFKKGGGEKKIFFFKG 
GKKKKTGGFFEKKFFLGGEKY* DNLPKK 
NKTVGEKKNFLGGG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVDAGEAQHHPRTCRRPLRALW 
SSHHERWKVTLCTHCSLGVFFLYCCTYY 
IFVLFIP*SSCGLTIjIFITCIILFGSIS 

PFLFFT I VFS I I I VTTFKFRLLYS I I FL 
SYLLC 


2939 


16840 


A 


2958 


1 


243 


NLPRLNPKEIE IIjNRSI I CNKI KAI I KS 
LIiSKKSSGHNDVTASFY*TFKEEI:iITIL 
LKFYIiTPKKKKKKKKKKKKKKGGPF 


2940 


16841 


A 


2959 


373 


3 


FSSLKKRVTPPPPPKTGFSLEGIiHLLKK 
NFPQKPPPPKKSFSQKNPPPPKKKPPF* 
KKKPPPPPPI*HPPPKILQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFIVRKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTIiPISYKlArNNRALMKAHIiIMK*F 
TEYFKTTDELYLHDNT 


2942 


16843 


A 


2962 


1 


358 


HTPPVRKGL* YGI ILFITSKVFFVPRFI 
* AVYHS SLCPTLQLGGLWPATGMTWLNP 
L*VPIiCTTFVLIiA* *VSIT*AHHNLIDL 
NRCHVIYALVITILIiRIiYFTLIiPASEYF 



727 



wo 01/64835 



PCTAJSOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProJine, 
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W=Tryptophan, Y=Tyrosine, 
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nucleotide deletion, \=possibie 
nucleotide insertion 














EAPLTIS 


2943 


16844 


A 


2963 


2 


373 


RAYDIVRPRP * PLLGALS ALLMTYGLAM 
*VHWDCITLLILGLLTNTLAIYQ*WRDV 
SRESTYQGHHTPPVQKGLRYGIILFITS 
KDVFFAG I V* AF YHYSLAPTPQLGGHWP 
PTGITPLNPLEV 


2944 


16845 


A 


2964 


403 


61 


LFFPLEKNFLPPGFSAFPSPFSP*KFFF 
SPKALIFWGNFSPFFPPPKIRFLPKIPP 
WVFFSPPFWEKLFSSPPPLNFGPPRVLF 
KGPPLFFFFFFFFFFFFFFFFRFEGBCW 
RL 


2945 


16846 


A 


2965 


2 


360 


LIPNLAMVTR*GNHPQRLNAGTYrjLFDT 
LDGSLALLIGLTYTRNTLGSLNVILLTI 
TAQELSNS * AISTNLV* LAY TLAP I EKI PI* 
YGLHL* LPKAHREAPIAGS I VIiAAVLS K 
LGGSGKN 


2946 


16847 


A 


2966 


321 


2 


STGMHFPHINMAINPPPPPRPPPPIiFPP 
NPKTKPNPTQ*KGGFPPLGPF* KKLSPF 
LTLFPFFFFFPFGHHPGPWQKKKKKKVP 
FFCVGFSFQRVHNIYKNTHQQQ 


2947 


16848 


A 


2967 


200 


1 


RRTYTSHLLACLRQGLAFSPRLECGGKI 
RAHCSLQLYGSSDPPT*APQTAGTKQHN 

QRIAQCNADN 


2948 


16849 


A 


2968 


2 


354 


LLTASSSEIAPLQSSLGDRARLCLIK*K 
EGVLlSrs I * SGNQGKS YANVYRLL YLDP I 
PKIYAEAYTP*NVNSTNLETKSPKTIQK 
FPEDREFKNDFFEKTKNGDRETGAFHPF 
LLFLV 


2949 


16850 


A 


2969 


99 


400 


ALGIHFIFTVSGLTGIGLANSSLHIVLH 
DTYYWAHFHYVLSLGAAFAI IGGFIH* 
I PL YS C YTLDQT YAKI HFT 1 1 F I GVNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


377 


NILLLTLTAQELSNS*ANNLI*IiAYTIA 
F I VKI PL YGLHL * LPKAHVEAP I AGS IV 
LAAVLLKLGGYGI IRLTLILNPLTKHIA 
YPFLVLSL*GIIITSSICLKKKKKKKKK 

KKKKKKKKRGGPF 


2951 


16S52 


A 


2971 


1 


407 


GTRSYTHLYRVFLELIKMSVYDLNHTVI 
MVI SGHVRLAFYG I VHLTL I LNLLTDH I 
LYP FLVLSLSGVI 1 1 S S I FLRRTDLE S L 
I AYS S I SHI ALWTAILIQTP * S FTGAD 
ILIIVHGLTSSLLCCLANSNYER 


2952 


16853 


A 


2972 


10 


259 


SRSVAIYFKGMASA* *RMFSSKKKKKKK 
KKKKKKKKKKKKKKTAITKKTTTAKWKN 
*RTEKIRHTRLPLISSRDAKCVDFLyT 


2953 


16854 


A 


2973 


12 


400 


LNCRTPSLYLQRAGELLSVENPHIWCQK 
CVRKNWFFLFFFFFFKTDLYCPHICIAL 
S ITCFLAI SISGLFAFFLIiNSFNYHFI I 
VP*NFSLS lYLCLHSLPYVGFFFSFDMM 
IFSILT*TLFPCLLFHL 


2954 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ*KNLSSLQPVPPGPKL 
FFPLSLIiSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG*LECLPPLGGGQHRALGLGATWHLEN 
ACALVLADLGTGPKPS6PFSGQACGASG 
RLPLLSGPILPPLGQGHTLSAWGHHA 


2956 


16857 


A 


2976 


7 


329 


SRDILViyiTEDGEFF*GHV*P*DRSRRAD 
TPRLGGHWPPTGITPIiNPIi * VPLLNTS V 
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location 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartlc Acid, 
E=Glutaniic Acid, F=Fhenylalanine, 
G=GIycine, H=Histicljne, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q==Glutamine, R=Arginine, S«=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X==Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














LLASGGSIT*THHRIjIEYIRHQIIHALL 
ITILLGLYFTLLQASEYFESSFT 


2957 


16858 


A 


2977 


2 


339 


AHHIFTARIDVDTRAYFTSATI r XAI PT 
GDKAFS*riATLHGSNMK*SAAVL*ALGF 
IFLFTVRGLTGIVIiADSSLDIVLHDTYY 
VGAHFHYGIiSIGAVFAIIGGFIH*CPLF 


2958 


16859 


A 


2978 


394 


0 


QGCSEL * S CHCTPAWVTE * FP VS KKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PIjCFPSQHWGKSSNCIiSSLKYFPSSGLP 
LRCKAELDDVKQKADKELDWAHACNPS 
TLVGQGGRITLRSGV*DQPGQH 


2960 


16861 


A 


2980 


2 


378 


ARVS IGFLGLI VGAHDMFTGRMDLYTRT 
YCTFATILIAIHTGVPVFI *IiATIjHGClSr 
MK*SAAVL* ALRF I FISTVCGLTGIVLP 
N* SLDI VLHYMYYAVAHLHYVLS 1 GAVF 
AriiGRFIHRLPVF 


2961 


16862 


A 


2982 


2 


409 


PAVAEAYLKPWDDSKGSFLWGKPDLDG 

IRECCQRNFGWNRTRTDESLPPVLKQLD 
AQQTQLRIDSFFTIoAQQEKDDAKRIKSQ 
RIjNRAVTCMLR I E I E AAS SEX E AVS VD * 
QKELEIjIDKAVACHIiEYMYETDP 


2962 


16863 


A 


2983 


3 


453 


HASAHASAHASGQRKGAAPAEKKCGAEA 
QHEGLELRVENLQAVQTDFSSDPLQKW 
CFNHDNTLLATGGTDGYVRVWKVPMLEK 
VLESKAHDGEIEDLVLVPEGML* IVCPD 
H YS P VG I NDFMWLL * LNWGNRL Y PPHI S 
VYLMFHFRTF 


2963 


16864 


A 


2984 


3 


417 


LILPGFGIISHIVTyYSGKKEGFGYICM 
V* AMI S XGFIiGL I VRAHHI FTVGIDVYS 
RAYFTSATI I lAI PTGVKVFS * liSTLHG 
SNMK*SAAVL*ALRFIFLFTVSGLTGIV 
IiANS S LD I ALHDT YYV* THFH YVLS I 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSPPLFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLiAVFTFI 


2965 


16866 


A 


2986 


176 


335 


TFQPSELWRTALLPRLECSGLI lAHCSL 
ELLGSSSPLASAS * lAGTTGS ILRY 


2966 


16867 


A 


2987 


65 


398 


KKEFKI GRKAAEKMTRNINNAFGPGTAN 
ECTVQWWFKKFCKGNKSLEDEECSGRPIi 
EDDNDQWRAIIEADPLTTTREVAEEIiSV 
Z»!rHSMAVQHLK*VGKVKKLNKWPHEnS 


2967 


16868 


A 


2988 


2 


158 


PGWSIjTPDLR^STCIiSLPKCWDYRHKRP 
SPTSRQTFDHAPNKNSSHSPP I YM 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF*KPPPKRRKKIFH 
DPGKKGPPKGIFKRGPPLFFFFYFFFFF 
PFFFFFFI ITIECLHSRFPHRHHNKKFP 
PNPPSPRFWPQHIiNTSLPNPKNKEP 


2969 


16870 


A 


2990 


410 


3 


GGRGHFPSGAFFIKFPWNKKGISQPLCF 
PRGGGS PPPLGPVRGGGPPCWGPLCHKG 
PVKKTGAPRGKNGISPFFCFPPLGSFNR 
SGFFWALLGSPPrjFFFFFF*DRVSLHHP 
GWNAVAQS*LTKTSTSWVQVILL 


2970 


16871 


A 


2991 


371 


132 


KQSLPFKVKGFFFPPEVENAIYPDSHTA 
FYSW* KRSTPSKKKKKKKKKERKRKKEE 
RKEGRKEGRKERKKEGKKS lENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK* STGFGLPKCWD3SIRHES 
P CPAS S VLiND * QLST VQRAT * WQTKS 
LKAPPARGAIiPVSRAVSPPPSSHHFAYF 
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NO: of 
nucleotide 
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NO: in 
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nucleotide 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla]anine, 
G==Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=MethioniDe, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IiHMI *QVSCLFLR* SIjALVPQAGVQ*RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHLS PG* LRKFFFFFGRDGV 

LMLSMLVFNSWTQVMLiLPQPPQVIjGLR 


2973 


16874 


A 


2994 


2 


181 


FHHVGLDIiLTL*SACLGLPKCWDyRREP 
LCPALVILIiTAKFTNFRYRVKNVCTPHV 
ESN 


2974 


16875 


A 


2995 


1 


165 


GFHCISRDGLDLLTS*YARLGLSNCWDY 
RHEPPSPAPLFISYSICLFLSKLIQFL 


2975 


16876 


A 


2996 


405 


2 


KKGFPIFSRMFFLYKHIEKPPIiASQKFG 
DPKWS PHPRP I FFFIiLKKGVLYVWREGF 
KFHPP*FPPPGPPKRWE*RVKPPHPPPF 
PPFFFLYVEMGSHCVAQAGLKLLGSSTP 
PTS ASQGAE lAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGLTGIGSAGSSLDIVLHDTYYV 
VAHFHYDL S IGAGFAI IGGL IH* FPLFS 
GYALDQTYARIHFTI IFIGVNLTFLPQH 
FLGLSGMPRRCSD* PDAYTT*SMLSSVG 
SVITLTAVIVLI IMIREAFDSKRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY* S ITAFRPFQHMELRNFFFFFLFF 
FFFFFFFFVFLFIFIaFLFYFLFLYEIYIj 
ILFFL^HHFYSAV 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFSKEKKKKEKRKKKKKKK 
KKKKKKKPGGGS * KTALSHDCATVLQPG 
RLiFQKKKKKRKKGKKKKRKKKKKKKK 


2979 


16880 


A 


3000 


104 


373 


SADREXXSKTDNLLGH* TlvnnSTKCKVPRV 
I QSVFS SHSGLKLE 1 NNRKMKEKSLiNT * 
KINNISYWVQCSLYHLKSNSYTKSSDNT 
TTQYMY 


2980 


16881 


A 


3001 


1 


396 


LDCSKISSYLQKSSSHVLFFSFSFFFFF 
GGGGGFAIiENPYPPAGLGPQKKKTLSPP 
WLVGGPPFPEKAPP*GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 
PTGFGGKPFFSF* PREPPGP 


2981 


16882 


A 


3002 


412 


104 


FPFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP*AFNPFWGFFPIFSPPKFGPFSKI 
SRLVFPSLPFWEKFPFPLPPFFPAPIiRF 
FPKGPPXFFFFFFFPFXXY 


2982 


16883 


A 


3003 


400 


46 


LFFFFFYKIiFPPPAFGGFFPPFPIi*NFF 
PPPGPFFFLGGFPPFFPPPK*VFFPKIP 
PGFFFP?PF*KKFFFSPPPFFI.PPPGFF 
PNPPPPFPFPFFPPPFFFPFFFFPLCW 
VQVEAY 


2983 


16884 


A 


3004 


411 


69 


YSPLPPLPFSSPPSKFPWPPFSLFFLTR 
VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGDPAFFSIPHPRVGSLPPPPNWP 
FIGPSFW*ALFPAPPLFFFFFLSSFFFF 
F 


2984 


16885 


A 


3005 


3 


200 


DAWG*LFSTNHKDIGTLYIjLFGA*AGVIi 
GTALSLIiIRAELGQPGNXiLGNDHIYNVI 
VTAHAFVri 


2985 


16886 


A 


3006 


259 


146 


ffffffffffffffffffffffffffff 

ffgllndill 


2986 


16887 


A 


3007 


118 


414 


QNQPPQNKATHTVKXEKKEKPETKTVAK 
EHNKAKTAEKSEE * TKKEVKGGKQSKVN 
HTAAKVKEVQKTPSKPKEKEDNKKAAVS 
KHEQKDQYAFLRYMI 


2987 


16888 


A 


3008 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF 
FFPPPL*FFWGGFPPFPPPPKKVFFPKS 
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Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glyc!ne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine5 P=Proline, 
Q=Glutamine, R=Arginine5 S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














PPGFFFPPP*GKKFIFPPPP*FCPPPGF 
FLSPPPPFFPFFFFFFFFFPS 


2988 


16889 


A 


3009 


3 


302 


SLAS CLSY LVCVI FLGQPKPT I * LQNS T 
PHKIC*NPTERyVKTCTQIFIALLFKKEK 
QPRCPSAGEWINKMLYACTIEYWLAIKR 
YEILiyATV*MyLEKI 


2989 


16890 


A 


3010 


1 


409 


RLHDATFPIIEEIiITFHDHAIiVITFLIW 
LLVLCALFLTLTTKLTNTNISHAQEIET 
V*TI liAAI IIiALMVTiPSLRILYI TDEDN 
DPSLTXKSIGHQWY^TYEYTDYGGLMPN 
SYILAPLPLEPGDLRLLDVDWRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLARCGGACLQSQHTQQNHL 
NP6EKGCSES*IiPPCPPD*VTKQ 


2991 


16892 


A 


3012 


2 


423 


ARAARAHIVTYYSGKKEPFGYIGiyiG*Aiyi 
ISIGFLGFIGRAHHIFT* * ID6HTRAYF 
TSAT 1 1 1 AI P TG VKVFI * LATLHGSNMK 
* SAAVL * ALGFI FLPTE CGLTGI VLAtSTS 
SLYI VLHDTYYWAHFHYVLS IGAVFAI 


2992 


16893 


A 


3013 


2 


140 


ARAN I LLLTIiTAQ ELWDPRANNLi I * LAY 
TLAFIVKKPLYGLHL*LPKAHVETPMDG 
PILLAAKLIiKLGGSGIIRLTLilLNPMTK 
HI AYPLLGLSL* G 1 1 ITRS I CLRQTELK 
WLIAYS*ISHIALWTDILIQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 
KETSIiRTPLMTP 


2993 


16894 


A 


3014 


2 


420 


PVIiA?^GI S I LLTDRDLLTTLFD PGGGGD 
PILYQHLiF* FFGHPDDYIIilLPGFGI X S 
HIVTDYYGRKEPSGYIGMV*AiyiTPVGFL 
GFIE *AHHIFTVGIDAHTRAYLTSVSIL 
I VI PTWRQVFS * liATLHGSNMT * YAALF 


2994 


16395 


A 


3015 


6 


292 


AHHIFTV*irVYTRAYFTSATIVIAIPT 
GVKVFS *IATLHGSNMK*SAAVL*ALGF 
I*LFTVGGLNGIV*SY*LLDIELHDTY* 
FVMGCPRKVYF 


2995 


16896 


A 


3016 


3 


422 


TPIIIGGFGN*LFPLIISAPDMEFPRIS 
NISL*LLPPSILLLLASAIVEAGT*TD* 
TVYPPLAGNYSHPGASVDLTIFSLHLTG 
VSSILGAINFITPIINIRPPAITQYQTP 
LCA* CDLMTAVIiLLLSLPDLAAGITILL 


2996 


16897 


A 


3017 


21 


490 


TPFPGRHLTMFSLHLAGGCSILGAINYI 
TniINXRPPAITQYQTPLFV*SVIirTAV 
LHLLSLPGLTAGVTILLADQNLNTTFFD 
PAGGGDPIIiYQHLF* FFGHPEVYXLILP 
GFGI ISHIVTYYCGEKBPFGYIGMC*AM 
ISIGFLGIIV*AHHrF 


2997 


16898 


A 


3018 


98 


402 


LRSQHSKSFQISGKPSQEEWPQISPDST 
DYIINT*HFNAQMLKNIY*HQPHGIiHDF 














KKASSSSSKKDPPGGGPT 


2998 


16899 


A 


3019 


399 


229 


PPPPGGGGPQGPPPPRGGFLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 
P 


2999 


16900 


A 


3020 


2 


401 


S DAVIi * AIiGP I YIi VLSGGLTGI VIiADS F 
LDLELHDTYYVGAHFHYVLS IGAVFAI I 
GGFVD^FPLPSGYTLDQrYAEUTHPrilL 
IGANVTFLPQHFLGLSGMPRRYSDYPDA 
YTT*NILSSVGSFITIiAAAIiL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKIFFKKTPKKKIFPPPQF*IFFP 
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Amino acid sequence (A=AIanine 
C==Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I='Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














PLPPKKFFFSPNP^FFLGGFSPFFPPQK 
KIFFPKIPPNFFFSPPLKKKIFFFPPP* 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 
FFF 


3001 


16902 


A 


3022 


2 


332 


LTLS I LS 1 1 AGS * GGLNQTQLRKILAYS 
SITHIG*IIAVLPYNPNITILNLTIYII 
LTTTAFni:iLNi:4lSrSSTTTriLLSRT*NKIiT 
*LTPLIPSTLLSLGGLPPLTGFLPKW 


3002 


16903 


A 


3023 


1 


412 


RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRPFFFFKKKKKSSRPTRDRV*FyPK 
GWRSPLFLFFSPGGRGPFPLSHQVGFSIT 
EVLWFKKNLELFRGSPARKKKBCKKKHL 
E 


3003 


16904 


A 


3024 


279 


1 


liGRNTELWKSGKGMDILKTNCGKLANEP 
FRQPRVLGIGGEAPRAGSGPPSRAPPA* 
TPGPSSAGSWP* PPGTGRAPR6PAPSAP 
GARSPGRPG 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFSLKGGKKKPP 
GKFLKKTFFLGGEKMGKTPPKKLKP*GK 
KKI FKGKRGKKNPKTLAVKKFS KKKKK 


3005 


16906 


A 


3026 


416 


140 


YLSPLKKFFTPPPLRMFLPPNPLKNIFF 
P P QL KI F WGGWAQNS P P P KKGF FS KNP K 
RVFLPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


16907 


A 


3027 


379 


31 


PPPRRAGVFFPKNPKPKPPPPREGGRF* 
PFPPIiKF*FFPKPQNFLGGGGArPTPPP 
KRGLGQIPTERFNLSPPTQKRINFPPPG 
KGGPPPPLLKPPPPPPFFFFFFFFFFSIi 
NSFI 


3007 


16908 


A 


3028 


28 


420 


MQQTTMAHI FLCNKFANCAHVPRT*SET 
KPMSTPLQFD*TYKGEKSCKYAEHERTW 
KQ*CVFSLYQIIPT*EKTWKCNQCGTNF * 
NQFFKQTTHL*NHTRDNQICFSKIGLEY 
YYRITTRQHLLKLRTVC I L 


3008 


16909 


A 


3029 


1 


401 


LGNNGEAVSEKRKEKSQKEKSHNWVGF 
FFFFWGKPPFVPQAEGQGRNFC* PKPWP 
PGWGEFPGPAPRGGGNGRQKQPGRGNFG 
V*REKGGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


A 


3030 


285 


397 


MFIKGDGLNKLRPGAAAYACKTPSTIiGG* 
AGRITRSGD 


3010 


16911 


A 


3031 


265 


2 


KKESSCIKATNSNSLFPLF*SVFPFFVF 
FFFFFFFFFFFFFFPFFFFFFFFFFFFF 
FFFFFFFFF*HFKSALLWVTDLGFIiNSD 
PRV 


3011 


16912 


A 


3032 


410 


49 


GFSPPPP*KFFFSPKPIjNFWGGGGPNFP 
PPKKRFFPKNPPGVFYSPPKKKKKFFSP 
PP*NLAPPKIFLKSPPPFFPFFFFFFFF 
FFLSNVSNGLTNMYILPCKDPSCPTTFP 
IIjGSLISL 


3012 


16913 


A 


3033 


177 


2 


VTPPFFFFFETRSHSVARTGV* * SDRCS 
DQPHPPSIK*SSHFSIiPSSWDYRSMPPH 
AS 


3013 


16914 


A 


3034 


75 


395 


MSYKHXXKKKKKKKKKKGGAP * KKPWGG 
PKLTRDGKKKPFSLKGAKKKPTWKFWKK 
TLILGGRKMGTTPPKKLKPLRKKKrFKG 
*RGKKHPKSLPVEKFASRGRIKK 


3014 


16915 


A 


3035 


400 


83 


KKEGPARWSPVTPPIiLEGPVGRSPQTR 
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Amino acid sequence (A=Alanine . 
C==Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=L,ysine, L.=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=pos5ib]e 
nucleotide insertion 














NF*PTPPTHRKSLPS*KSKIDWRGIPLY 
PPPPRVKPKKSFNPGNRRFC*TKIFPCP 
S PWAP KTHPP FQKKKKKRVY K 


3015 


16916 


A 


3036 


3 


196 


DSiyiPQT*NKSFAKARKKKKKKKKK:KKKK 
KKKKKKKRGGPPLKKTLGGPKLPGGKKK 
NFFFFRGG 


3016 


16917 


A 


3037 


1 


411 


FCYDVCVESGCADYS I VX IMKKKKEK* K 
KKKKKKGGGPLKKNPGGAQNYPGVEKKl 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPQKITKPLGEKKKFKREKGGKKIiQFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


3017 


16918 


A 


3038 


3 


429 


NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHPPPPPKRFFFQ 
KPPRGFFPPPL*EK1TFFPPPPVFPAPPP 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


IGLSGMPRRFSDYPDAYAT*NILSSVGS 
FiXIYLQQDNK 


3019 


16920 


A 


3041 


76 


967 


QLLKGGVSGVCPLLMFRCVRSFPIjLVGS 
WSSLASGVKPQTFAVSVTVLKAARLELP 
I PPRGLWSIASGVKLQTFAVSVTAHKS 
SVDPKNSGAQLASPSGSRTRAAGGAACQ 
SRCRVPALLSPWWDGTGRRGAGGGARR 
GGSGRTGAHGVGGRLRHGGLHVPSPAPW 
KGS * GLARNRAQRRWAGTAGGP S TPSAA 
AGPGAKSLTALCEQGWPAAPSAGPTKPT 
PTRNSSWPASVARSPGSRSCLSLHTSLQ 
AEGVGSSLGQPSKGLPQCSGGAEGLLKC 
RQSGSPGRGGTESERGL^GLPQCSGGAE 
GLLKCRQSGSPGRGGTESERGL 


3020 


16921 


A 


3042 


39 


141 


LS IRGLNI I IKRQRL*DWIKQQDSTLCC 

P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTNHIYIGTLYLLFSTRAGVLGTALS 
LLIPAELGQPGNLLGNDHIYNAIVTAHA 
FVIIFFIERPXIIGGYGN*LGPLIIGAP 
DMAVPRINNITF*LLPPSLLIiLLASAIE 
EAGAGTG*TVYPPLAGNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLNPGGRGGS EPRLHHCTPS WATE * DS V 
S KKKKS RKGWTGLF I 


3023 


16924 


A 


3045 


44 


187 


DPRVRQYQTPLFV*SGLIFAGLLLLSLP 
TLGAGI TI LLTDRWLHTAVFDPDGGGDP 
IlaYQHLP^FFGHPEVYlLILPGFGIICH 
IVTYYSGKKEPFGYIGMV*AMISIGFLG 
F I AGAHH I FTGG I DVDTRAYFTS ATI I I 
AI PTGVKVFS * LRL IRPNLCRLTSP ISP 
NPRGRHHYTTNRPQPPHRRV 


3024 


16925 


A 


3046 


434 


40 


, GAPPPPPGRFFFFLNPREDTFPPPPQKG 
GPSPPPPPKFFFSPPALFFFGGGWPNSP 
PPQKNFFF*KPPEFFFFPPFIiKKKIFFP 
PPIjFFPPPQIFP*TPPPLFFFFFFFFFF 
PPFFFFPRGCKINFIVRGF 


3025 


16926 


A 


3047 


1 


399 


LPTGGGLTGIVLTNS SLD I VLHDT Y* W 
PHFHYGLS IGAGFAI IGGS IH* FPLFSG 
YTIiDQTYAKIHFTIIFIGVKIiTFFPQHF 
LGIiSGMPRRYSDYPDAYTT*NILSSVGS 
IjISLTAAILIIFMS*EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASMILSSMIFLECIVGGFPVFSVYIiFK 
LQILRQSSTMCPVLFCPFEIRSCSVTQA 
GV* RRGHGSL * PQPPGLSHPSSRDHGHV 
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sequence 


SEQ ID 
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sequence 
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NO: in 
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09/515,1 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H==Histidine, I^^Iso leucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


3027 


16928 


A 


3049 


6 


345 


SQLLRRLRKENCLNL.GGGGYSHQR*HQC 
PIAWATE*DYLKKKKKNFSLFIjKTGGFY 
PFLKKAPIiGTTLSPKNNLPLSPIiYKKRT 
GPWNWERKNFGKKKGGVIGLQGGKTI PK 


3028 


16929 


A 


3050 


261 


2 


NKKSPPVIJLWWKMGFFFKFAKRVVLSWK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQPRLPDLR*SACnGPPDC*DyRREPQ 
YP 


3029 


16930 


A 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGIiATLS 
CLSLMSSWECRQPPPLG*FFVCPR 


3030 


16931 


A 


3052 


89 


3 


PLTSGLLLLT*QKLAPISIIYQISPSIiN 


3031 


16932 


A 


3053 


344 


2 


HVEAP I AGS I VIAAVLLKLGGYGI IRLT 
LILNPLTECHIAYPFIjVLSL*GIXITSSI 
CLRQTDLiKSLIAYSSISHIALWTAILI 
QTP*SFTGAVIIiIIAHGLTSSLLFCIiAH 

SN 


3032 


16933 


A 


3054 


266 


2 


FQPPISAYTKISPSLNVSLLLTLSILSI 
IAGS*GGLNQTQLRKILAYSSITHVG* I 
lAVLPYNPNITIXiNLTIYIILTTTAFLL 
LNLN 


3033 


16934 


A 


3055 


3 


32 


KYNSIiIMPTMIATITbLNLYFYbSPLLY 
**SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNSLIIPTIIATITLLNLYFYLSPLLY 
**SSSPPS 


3035 


16936 


A 


3057 


57 


445 


ANVWAPHGPAKLTNKDNYHIWKSKRLKI 
AWMTIKKLlSrEVIGLTLPDFKTYVELVQ* 
RQNAID*RKHKQPVKQSPEA*PHSYSQL 
I FHWGAKANHGRKD S LF YK* CW * ISFWT I S 
SQKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 


1 


RVGIiLLKLNKISWPPPPPYGPS *EKEPL 
CFSQIGLFLTRTMVLNNLHSPPVKTRPY 
NKIAPFRELFFFFKDRVSICLPGWSAW 
* SQLTATS TSQAS SDPGRV 


3037 


16938 


A 


3059 


138 


411 


WERPWKAQEAVFWI *VSAFWAPPPLMEK 
QIPPDLEQHYRNVPGVNRNQPFVSFFLR 
WSLTVAQAGVWWRDLGSL* PLLPGPKRF 
L.CLSLLS 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTSSDPPASAS * SAE I T 
GVSQRAWSKITILKSSSFSYFPNSCKMC 
FWLICLN 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLilLPRFGRTSHIVTYYSGKK 
EPFAYIGMV*AMISMGFLGFIV*AHHIF 
TVGIDWTRAYFTSATIIIAIPTGVKGP 
S * liATLHGSNMK* SAAVL * ALRF I PLLK 
KKKKRGAVLKVPWGGPSLPGCG 


3040 


16941 


A 


3062 


451 


82 


PPPTNYFSPPPAFLPGGGGPPRPPPPKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPPGE 
NRGVFFSTKPPPPW6KKNFAAPGAPPPP 
PFPFFFPRGGG 


3041 


16942 


A 


3063 


430 


1 


FFPPKQLIFWGGGGPKSPPPKKKFFPKK 
PPGVFFSPPKKKKKFFFPPPLNIiAPPKI 
FFKSPPPLSFLFFPFFFFFFFFFFFFFF 
FFFFPLSVQTLLKRTRAPPQPPLLD*EK 
APAPRVP*TGEGMPAVNVAFAPPPFyKE 
RPS 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPPVQKGLRYGIILF 
ITSEVFFFAGPF * AFYHSSLAPTPQLGG 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HWPPTGITPLNPLEVPLLTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFSIYQKATVlKTVWYWYlSnsrRHTVE 
SPE INPYI YS * L I PFLF 


3044 


16945 


A 


3066 


4 


426 


KLEN * KMVLKE I KEDLNKQTD I LFS * LQ 
RIi3:TVRMSIIiPiCIiIYKFSAIPIQIPA*P 
L* I KI I I KCMRKGK*TRI AET I FFPFFL 
SQSFILSPKLDHRGGITAISrCTPPWAIKG 
KLIiLKKiCKKPPKTKPEKFFIQKATGAEG 
GVH 


3045 


16946 


A 


3067 


411 


187 


RNLPNVPPRPTHFVLLVKTGVSQVGQGG 
GKLLASKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 


A 


3068 


193 


471 


QCTCIKVHSGQKTGSTPLVIGEIiQIKIT 
LGCYYTPTLMAQIKKTDRTKCW*GYGAI 
GML IliCWRECKI VQSL* KRVWQFI I * LN 
lYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFF F ILXYFYYF * KLFYLYLFX 
YIFKIYFIYNFIYIILFYIIFIIF 


3048 


16949 


A 


3070 


1 


378 


GTRRFFFHS I TLL I LGLLSNTLTI YQ * W 
RDVTRESTYQGHHTPPCQKGLLMGI ILP 
ITSEVFFFAGFF*AFYHSSIiAPTPQLGG 
HWPPTGITPLMPLSIPLLNTCVUiLASGV 
SIT*AHHSLIENNR 


3049 


16950 


A 


3071 


201 


2 


TTPIQIjFLKHYHT*NFNYNFFFEIGSCS 
lAQAKVQWCDLCLLQPQTLGLKHSSHLS 
LQSSWDYRHA 


3050 


16951 


A 


3072 


261 


1 


EKAMGGGPFKVKKS PGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


E I FKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLETFFFFPERGFFFFFFFETES 
YSVTQAEV* WNYLGSQQP PRFKRFSHLS 
LPSSWISTYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRLNPGGGGCGE PRSCHCTPVWA 
TE*DSVSKKKKKKKKRGGWVPPSLGGGP 
KKKNPFPNQEGGL 


3053 


16954 


A 


3075 


180 


419 


QKHLS IliHYLFKRDVFPFFIiKGSFWSQ 
VGGQGHNLG*LKAPPPRLTHFSCLTIiRE 
TWKNRPRYFFCPFIKTGFHHVTRE 


3054 


16955 


A 


3076 


371 


208 


QKLSGHGGSRL* S * PF*EAKAG* QLVSG 
GRGCSQrj*SHQCTPAWVTE*RLVCKR 


3055 


16956 


A 


3077 


349 


3 


TFCWQKYTMCRSLCHPHPPRTWSTKKKR 
PPFQKGQGDAPPYKKVQRGNPPPPLKGR 
PSRGPPKKCKVFKAlPVFLFPRFSPPPFF 
PPPLFFPFFF*DRVLPCCPGWSAVAQLT 

ATS 


3056 


16957 


A 


3078 


124 


1 


ISSETPAVKSSFWPGWVHACNPSTLGR 
*GKEITRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CL VQN I WACVSHWRY I iViS CKGC * RLGLM 
CVCVCVCVCVCVC VYFCCCCCCCYC* DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 


413 


GHHTQPVQKGLLYGIILFIT*EGFFFSG 
FF^AFYHSSLAPTPQLGGHWPPTGITPL 
NPIi*VPIiLISrTSVLljASGVSIT*AHHRLI 
* NNRNQ I I QALL I T I LLGL YFTLLQAS E 
YFETPFTISDGIYGSTFFVATGFHR 


3059 


16960 


A 


3081 


1 


321 


NSLNPGDGGCS * PRLHYCTPAWGTELDS 
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Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H-Histidine, I=Isoleucine, 
K=Lysinej L=Leucine, M^^Methionine^ 
N=Asparagine, P=Prol!ne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














ISKBKKYPYHVYHPSMKMTIYNYRQQNV 

NHIYIMLVEHSQTQENTCFMISGNFFFN 

LPIVLGEGEKNQHS ISFKLFFNF 


3060 


16961 


A 


3082 


117 


484 


VLKYFIDSEVNAVLFSISCSFVTDFVFL 
FFFFGKGVSFCPPAGIKGGGFGFLSPLA 
SGFKRIFFPNPLEKWE*RARPPPRGKFW 
NFFFFPFLKRELFFAPRWEGRGKILVYL 
KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


CISRDGFYHLGQAGLELLTSSDPPAIiDS 
* S VGI TGVSHRTRPIiLLRLNVQF 


3062 


16963 


A 


3084 


2 


391 


SHAYHIG*PSR*PVTGALSDLLMTSGIiA 
M* IDFHS ITLLILCLLTNTLS I YQ*WRD 
VTRESTYHGHHTPPVPKGLRYGI ILFIT 
SEVFFFARFF*AFYHSSIiAPTPQIjRGHW 
PPTCITPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


IMRSGDRDHPG*HGETPSLLKYKRLAGH 
GGMRLWSQLLGKGGTADSHHHVXjLILET 
FYSLRERRHLTSVPTLGMNYWAQDIR 


3064 


16965 


A 


3086 


276 


243 


EKWPD^SRAACPVLCRGNGQYSKGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGICI 
MGSCACNSGYKGESCEEAPRYIPEKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQAGLELLTL*SACLSLPKCWDY 

RREPPRPAHTPPHS 


3066 


16967 


A 


3088 


277 


2 


SSSSVFCDLVWTSSSSSSSAARIiPPLTG 
FLPKWAIIEEFTiONNSLIIPTIIATITIi 
LNLYF YLRLIYSTS ITLLP ISmJVKIK* 
QFEHTKP 


3067 


16968 


A 


3089 


404 


60 


fshgkmrffs p ps p kki ffspqsfyflg 
gggakmpppkkrfffkktprgfyfpp-*k 
kknfffpppvkfgpprgflkspppffff 
fpfffffffffffsptfftvfhlmlksd 

ND 


3068 


16969 


A 


3090 


405 


186 


inkkpeaftntvx>qmvltnshrtfypta 
tacslsgahrtfsrmdhv*dhictsiinkf 
kte i tlstlsnhklep 


3069 


16970 


A 


3091 


371 


1 


spplvqkglrygi ILFITSEVFPFAGFF 

*AFYHSSIiAPTPQLGGHWPPTGITPINP 
LEVPLLNTSVLIiASGVSIT^AHHSIjIEN 


3070 


16971 


A 


3092 


263 


3 


nkrsppvnliwwkmgsffkfakrvkiswk 
ggggffffswrrsftfvaqagmqwrnls 

SL,QPRLPDIjR*SACLGPPDC*DYRREPQ 
YP 


3071 


16972 


A 


3093 


1 


392 


fffadpkkmfilinhfkmelttypelkr 

NEATASENC*DAVKAVLGGKFIVIjSTYI 

rkeerprinnisfqikhwtkkn*t*gkq 
KKKKKKKKVTGPE I pkflivksgkppkv 
iltgawgpikflsfitrl 


3072 


16973 


A 


3094 


3 


367 


SM* IE I TMRYHYNTARRLKF * KTDNIKC 

* *ghgtsgtliyhwqe*kmvqplwkiv* 


3073 


16974 


A 


3095 


3 


257 


HEVSQDGLNLLTS * sarlglpkcwdyrr 
eppclawlilpddcvifqklkllhhnll 
ndscidvlmgiyslsnfsqsnfpfffff 


3074 


16975 


A 


3096 


189 


414 


kglvmvts*ckmffclsifffffergfc 

FFAQAGVQGHNLSSLEPLPPQIiKQFPCIi 
TLPRS * KYRPAPPCPANFY 


3075 


16976 


A 


3097 


1 


299 


ENYRP I S FMNTDAKIIiNKIIiANQIQQCS 
KRITHRDQVGFLPGMQGQFYI *KSIKSI 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NAIHHI SRLKKRNHMIPS I EAKKAFNTH 
S*FMKKNKKLAFQET 


3076 


16977 


A 


3098 


362 


126 


FIHTHTHTHTHTHTHIYI YI YI lASFQW 
NAIRGRMKCINKP*KDMEELKCILPSER 
SQSAKATYLLYDSNYITLEKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYHIVKPTP * PDTGALSALIiMTSG 
IiAM*FHFHSITIiLII*GLLTNTIiTIYQ*W 
RDVTRESTYQGRHTPPVQKGIiRYGI I LF 
ITSKVFFFAGFF*DFYHSSIiAPTPQLRG 
HWAPTGITPLNPLEVPLLYTCVLLASGV 


3078 


16979 


A 


3100 


392 


83 


LRNTNCGHGAPFKWSPFFPQPLVBCNrSPA 
YFQNKEKDVGKPPPFFRAPDGAPFFKKK 
KK*GIiALiSSRLEYGGMISLELLGSSDTF 
ASASRVARTTGQCHHAGPT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF*KKTGKNFWPPFFKG 
KEKPPPNPRGGGKKGGPKPPPVIFFFF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPP*YIGMG*AMISIGPLGFIV*A 
HHIFTVGI YVDTRAYFTSAS 1 1 lAIPTG 
GKVFS *IiATLHGNNMK*SAAVL*ALGFI 
FI FTES GLTG I VLiSNS SLD I VLHDTCYL 
VAHLHYVLS IGAVFAI 


3081 


16982 


A 


3103 


3 


387 


HERHEELSNS *ANlSriiI *LAYTIAFIVKI 
P LYGLHL * LPKAEIVEAP I AGS I VLAAVL 
LKLGGYGIIRLTLILNPLTKHIAYPFLV 

lisl *gi i itss i clrqtdlksl i ays s 1 
shialWtailiqtp 


3082 


16983 


A 


3104 


3 


381 


HEQSHAYHI VKPS P * PLTGALIAIiLMTS 
GLAMGF YFHF I TLL ILGLLT YTIiTI YQ * 
WRD VS RE S T YQGQHTPPVQKGLRYGI I L 
FIT*EAPFFARYF*AFYDFRLGPTPQLR 
GHWPPTGITPLNSL 


3083 


16984 


A 


3105 


412 


in 


FliGHPLFKKKMGGKKRGLPKMGV^HPPA 
P KGKP P PLiKKKKKKPGGGGAPL YS PFS G 
GEGKKNPLTPEGGGPKNPNSPPPPPPGG 

KKKNPP PliS PKKKKKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFPFFFFFFFFFFPFFFFFFF 
F*L* IIY*AICAIWNIRGAGAELYRSRD 
FVCY 


3085 


16986 


A 


3107 


2 


417 


ATHVGLQDVTSPMIEQLITFHDHALMNI 
LI ICFLDLGALFLTIiATKLTKTNM* YAQ 
E I *TD * T ILPAI I LNIiMALPSLRMLYIT 
DEGDDPSLTIKSIGHQWY*TYEYTDyGG 
LIFNSYIRPPLFLEPGELRLLDVDNR 


3086 


16987 


A 


3108 


8 


430 


VGIjVIiFLSBLPLNGGIIjTFFHQGIYSPF 
PGGRTWAXiMVGSWGLVMASTDLLGPLCH 
AFT'PATQLriLNriAVASPL*PAALRrGCH 
SKTTNAIiTHPLPRGTPTPPRP ILVTIET 
MSUlilPPIAPDQRIAAGFTARHLLLHLS 
V 


3087 


16988 


A 


3109 


3 


453 


PRAIKFYRDWPGHERKRIAWKGDPCHMV 
LIKDEKGLMCQKKKKKTPFFWAPKIPLV 
FPPAQKNQGS*PNPPGGC3GNPPLRPGP* 
RKKPPALMGPPPSSPGGINPKSFFYIiNP 
GPAH* PRGEN*LGPVFENLFLPLLKKIP 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SIjFQKNPNPLVG*KKKGKALPQGPPPLF 
PPLGGAGPGGSQGRGWGPPRPPWGNPFF 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














F*NPKIWPAKGGAPFSPPSEKG*AGGFF 
LPRGGGFPIiTQIPPPPPHLGGKSKPPFQ 
KKXECKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


S SDPPTSAFQGSGTTDMCHHHHAQLI FN 
FFVETGSCYVAQAGVQWHDHGSLLSQTP 
GLR*SSGATVLNWAPALGPRRPDPTRME 
SLVLMKPRGSLIRSACPDCFLVFWFSFF 
HEAEGCASEC 


3090 


16991 


A 


3112 


233 


449 


FALFS*IiATLHGSNMK*SAAGL*ALGFI 
FLFTAGGLTGIVLANSSLDIGLHDTYYV 
EAHFHYGLS IGAEFAI IGGFIH* FALFS 
GYTLDQTYPKIHFTI IFIGGNITFLPQH 
PLGLSGMPRRYSDYPDAYTT*NILSSGG 
SFUIAIiNAGSIT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPEG 
VFFPPS PPPNFFFS PPPLPFFGGVPP I S 
PPPPKSFSPQTPPRFFFPPPPLKKNFFS 
PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMIiACVI 
LLRANRKARTRG 


3092 


16993 


A 


3114 


1 


418 


LNTTFFYPDGGGDPNIiYQHLY* F*GHPD 
PDIHILPRPGIRSHIDTDYSGKKEPYAY 
VGMGWAMTS IGFLWLMVRARPLFTVGVG 
VDAQAYS S FAS IT lAI PTGAEVFS *LCP 
LPLSGMK* TGAAVWALGLRF I FTCSGR 


3093 


16994 


A 


3115 


1 


425 


PRINNISF^-IiLPPSLLLLIAYAIVEAGA 
GTG*TAYPPIiAGNYSHPGASGNLTIFSL 
HIiAGGS S ILGAINF I TTI INI KP PAITQ 
DQTPLFV*SVLITEDLLFLSIiPGIiAAGI 
T I IiLTDLiNL STTF FD P AGGGDP I LYQHL 
F*FFDPAGGGDPILYQHLF 


3094 


16995 


A 


3116 


2 


383 


GLSCTNHKDMGALYLLLGARAGVLSTAL 
SLLIRAELGQPGYIiLGNDHIYNDIVTAH 
AFVI I FF I VIPTVLGGFGN* LGPLIIGA 
PDTAVPRINNIS I * LLPPCLLLRLACAI 
EE AGAGTG * TVYP PL 


3095 


16996 


A 


3117 


1 


259 


PTRPAIiWTAILIQTP*SFTGAVirjIIA 
HGLTS SLLFCLANSNYERTHSR I II LSQ 
GLQTLLPLIAF**IiLASLANLPAPTPTP 
HQ 


3096 


16997 


A 


3118 


3 


174 


LIRGGRGCHELRSRHCTPAWATRARTLS 
QKK*KTTNPKKKLCLIFFGGKKKKLKKG 


3097 


16998 


A 


3119 


155 


1 


PDFFNKSMDKKKTARGWEDSSSSFCFFK 
RDRVLLCCPGWSAVAQS*LTAAS 


3098 


16999 


A 


3120 


420 


2 


PPPKFFFPPTPPFIiGEGGAKTPPPKKIF 
FLKKPPGVFFPPPIiKKKKFFFSPP * FLA 
PPKIFFKRPPPFFFFFFFFFFFFFFFFF 
FFFFFFFFFI.VFIVLFQVKVHFLKKCFK 
IQFPLASDNS*PSMIHEKFYCESNIEF 


3099 


17000 


A 


3121 


48 


387 


RDPVLQKKEKKKKKKPKNQKKKKKKGGP 
F* KIP *GAKI KPGKEKKNFS PKRGAKKK 
NPGNFEKKTNFGGGKKWGKPPPKN*RFK 
GKKKFLKGKGGKKTQIPWGLKIFFNGFD 


3100 


17001 


A 


3122 


237 


2 


PFPWIiPPFPLKASSSP*SL*FIiLGGWP 
IsrLPSSP]SrKGSFPKFPSWFIjFRPP*GKNF 
YLALPR*PWPPQ6FL*TAPP 


3101 


17002 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPPPKTPVFLGVK 
PGVFFFSPYQKKPTNFGPKMGAIi*RIPL 
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nucleotide insertion 














FETPIWVFPIKGFHKKKPPVLN*PPTRK 
PPPDKILiKKKKKDDCISRLARN 


3102 


17003 


A 


3124 


453 


3 


SSAREGGGVPPPPPKIIFFPPPPIFSWG 
GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFFPFFFFPFI FFPFFFFFFFTKKKNWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVRPRV 


3103 


17004 


A 


3125 


443 


2 


YFPPFGRVI WGDSLGAGVIiNPPRPHKGT 
PFFPKXIFS IRPGWGGGPPIiPPPQRQMW 
GGPPPPLFGLFRPQGKIPFPPKKKIKPP 
GGVCVGVGVCV* KKKKITPPPPFLWAGLK 
EPCFFFFFPFEMBFCSCCPGWSAMARSQ 


3104 


17005 


A 


3126 


3 


186 


PVIYSTIFAGTLITALSSH* FFT*VGIjE 
INMLKKKKKK^KKKKKKKKKKKIOCKKKR 
GGGIi 


3105 


17006 


A 


3127 


171 


38 


KKKKLFFPPREKWGPPKNFLKRAPPPFF 
FFFFFFFFFFFFFWSERSS * VA 


3106 


17007 


A 


3128 


16 


189 


ILGEVIWV* *FF* FIKKKKKKKKKKK3CK 
KKKKKRGAP FKKTPGGPQ I T PGEKKKI F 
Sli 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
FPFLGGGGPIFPPPKKSFFSKNPPGVFF 
SPP * KKKIFFFP P PLFWAPPRFFFKGPP 
PFFFFFFFPFFFFFFFFFSQF 


3108 


17009 


A 


3130 


2 


312 


ANNL I * LAYTIAFI VKI PLYGLHL* LPK 
AHVEAPIAGSIVIiAAVLLKLGGYGI IRL 
TLILWPLTKHIAYPFLVLSL*GGGVF 


3109 


17010 


A 


3131 


449 


3 


ffffffflggppeifffffcppkkpkpp 
lgggkkkppff^nfpqkplgvlgapppp 
lcffikkkkgggkkffspplf*kggpfk ' 
kffffppppkgpppflknlrgwvffkpp 

PKKKALSFFKKKKKKKKKKKKKKRAAAR 
DLELADAW 


3110 


17011 


A 


3132 


95 


448 


V INRE * KC VC * MKKKKKKKKKKKKKKKK 
KKKERAGKGGG 


3111 


17012 


A 


3133 


60 


442 


LGGFFFFGGKKGFCLWCPRWGAKAGI PV 
NGTPPRGV*RNFIiAQPP*EGGITGPPPL 
PQ* FGFIiRENGVPLRGPGGFEPPILGEP 

PPLPPQKGGKNGRNPPPPLKGFLVLFKK 
EFSSLVPSWKARGDP 


3112 


17013 


A 


3134 


236 


45 


ffpffffffffffffffffpffffffff 

FFPFFPFFFFFFFPFFFFFPFFFFFFFF 

ffffffffffpffffgaad*vii.pryfq 
peatimgv 


3113 


17014 


A 


3135 


441 


88 


KQQTPPGDI FF* KAPRRE I PLPPPIMVF 
FSPPSPPKFFFFLSPFIFFGGVLPFFPP 
PKKGFFPKWPRRVPFCPPLKKKNFFFLP 
PFFFGPPRVPFYPPPPIFFFFFFFFFFF 
FFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMIiSPKPQQLNQQNCSPEHYEPQLKTQR 
TWR* KKXKKKKICKKIQCKKKKKKaCKKKKK 
KKKKKRGGGQKKKMVGGEKKKPG*KIFP 
FFIKKVKKKTALGDKKKTQFWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTPIPSQGQQSLAWSWAGIGSAQ 
PPALLHS * P IGKI FKNCMP VGRKSPQLP 
RNTSWQLGAVAHPSNSSTLGGRGGRITR 
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peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 
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nucleotide deletion, \=posstble 
nucleotide insertion 














SGVRDRTRG 


3116 


17017 


A 


3138 


448 


3 


FFFLPPSPFFPY*KRGFSLFGRVVFNFP 
PPVFPPPWPSQIFGFQALIFLPPPPPLF 
FSFPVGFFQTALFPFNGPFPGFFHGLFP 
riLRFCPRK*VFWGGFFFPFFFFI,RDKVS 
LCHPGWNAVAQSEFTTALTSKAQAPTRP 
PTRPPTRP 


3117 


17018 


A 


3139 


2 


436 


DR*LFSTOHKE IGTLYLLFGA* AGVLST 
AIjSLL I LAELGQPGNLjLGNDHI Y3WI VT 
AHALGKIFFIAI P 1 1 IGGFGN*LAPLI I 
GAPEMAI PRINNIS *GLLPPS ILLLIjAS 
AIEEAGAGTG*TDYPPLAGNYCHPGASG 
DLIIF 


3118 


17019 


A 


3140 


362 


2 


KPRRGKFFPPREGGGGFPPPPPKNFFFP 
KGGKFLGGGGGKNS PPQKKGFFQKNPGG 
VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGQSGQVK 
LKSPKCKL 


3119 


17020 


A 


3141 


440 


102 


PTPPPGCKFSFKRPPKKTLFFPTTNLVF 
FSPIPP*NFFFSPQAi:jIFVGFLAPrFPP 
QKKFFFSKFPPLFFITPPLIKKFFVPPP 
PFILSPLKIFYKPPPPIFFFFFFFFFSF 


3120 


17021 


A 


3142 


1 


79 


FKLDYFSIIFIPVALFVTWSIIEFSL*Y 
INSDPNINQF*KLDYFSIIFIPVALFVT 

WSIIEFSn 


3121 


17022 


A 


3143 


3 


441 


FFFPPPLKKKKFFPPPPNIGPPPKSL*K 

PPPPPFFFFF 


3122 


17023 


A 


3144 


223 


3 


LPYWKLiPYbKH***LQDTNQESRG*HFL 
RPRP FKNQMKSGTVAHACNPS TLGGRGG 
RITRSGV*DHPGQHGSI 


3123 


17024 


A 


3145 


65 


414 


KKKKKKKKKKKKKKKKKKKKKKKKRGGP 
PKKKTRGGPQNPPPKKKKFPPQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKKNTPP 
KKKKKF*GGGGQTPPPPPPGKKFSPPRN 

KKKK 


3124 


17025 


A 


3146 


410 


31 


RANQKAPRGKPLCDLAVGKNLSSRTQIA 
IiTI*KWINWTIIiKLRTSGH*KTPIKTIK 
RYPIEGEKISDEELYLQY*KVIi* TGKKK 

PDNPVEKWANDLKRSFMKGNILTVFKGM 
QR*LGSMAHICSLS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPIPMPLV 
NLNLGL L F I LAT S S liAVYS I L * S GGASN 
SNYALIGATiRAVAQTIS YEVTLAI ILLS 
TLLIRGSPNL 


3126 


17027 


A 


3148 


254 


1 


KTKKGLKIKDPLTRF* ISVC* ITKSI*F 
FKLLPFFPLLKEHIPLKYLFFFFFLFET 
EFRSCCPGWST]yiV*SLLTATSTSQIQAI 


3127 


17028 


A 


3149 


499 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 
RPFPPPPLKNFPFFSRVFFFWGGGAQKA 
PPPKKVPF*KIPRGFPPPPPKKKKIFFF 
SPGFFGAPPGFFLSGPPSFFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLSLQGFF 
LVS KE LFLFGLTVKFTRGFRGFCGQ 


3128 


17029 


A 


3150 


256 


146 


KNAKVTQVGPEFNKGPG*HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


T I SCLCTRGEHPLS PRRAGPYTGS PLHC 
CVDWDKVFS SWKDIiTDWPLGDLD IE YF 
TDGS S F I LRGVCRAGYAAVTLDSAVE VL 
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K=liysine, L=l.eucine, M^^Methionine, 
N=Asparagine, P~Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=:Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /'=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














SVSAETSA*KAELIALTRALWLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPETPPKRAAG 
AIMSLAQPFSVALPPCFFDPCAPSLSPG 
*HAIiRPLPIjPGLASEIQTAPSWHVPPKS 
LSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFPPKKKRGF 
FPPPPPKNFFFPQGGNFFGGGGGQISPP 
QKKGFFQKNPRGVFFTPPKKKKIFFSPP 
GKMGAPPGFF*RGPPPFFFFFFFFFFFP 
FFFFFFLKKSWRPLAI 


3132 


17033 


A 


3154 


371 


105 


S PS PQ WFI KGPKP P PPK* I L * RAPNP P 
IiPKKKPSNPPTWGPGPQPPPSKFSKFAR 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I*KNPPFSGFQSPDSIK 


3133 


17034 


A 


3155 


3 


371 


DVGADPILHTSTGIi*LAMQY* PEA*TAF 
S S I AH IT* D VY YG * V I RYLHANG A* IFF 
ICLLLHIGRGLYYRPFLYSKT*KIGLIL 
LLATITTAFIGYVLP*GPI*F*GATVMT 
NLLSAIPYIGT 


3134 


17035 


A 


3156 


1 


398 


lAIPTGVKVFS *LATLHGSNMK*SAAVL 
*ALRYIFIiFTEGGIjTGIVLPNSSLNIVL 
HDPYYWAHFHYVLSIGAGFAI IGGFIH 
* FPLFSGYTLDQT YAI JHFTI F I GVNLT 
FFPQHFLGLSGMPRRYSDYP 


3135 


17036 


A 


3157 


401 


46 


PPGGRNFFKKTPGKKIFSTKKKKGFFPP 
liPPKNFFFSPGGFFFGGGGGPNFPPPKK 
GFFSICNPRGVFFSPP*KKKIFFFPPG*I 
WAPPRVFLKGPPPFFFFFFFFFFFFFFG 
KKSVRA 


3136 


17037 


A 


3158 


404 


18 


FFSPPPPPPGGGVFPPNPNKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 
PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSNPPS 


3137 


17038 


A 


3159 


3 


192 


SLVIAGCPR*NLSSTIiNLPTEPSKSPCK 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDNFGFIiQPSPSGVKLFFCLSLPNKW 
DFRCGPPNPG*FFS 


3139 


17040 


A 


3161 


3 


384 


III VPTI ILrjPLT*I,SKKHI 1*1 INTTTHS 

I.IISIIPLLPFNQIWNNLFSCSPTFSSD 

HLSHPILKKKKKKKKKKKKKKKKKGGGA 

FKRTPGGAAHWGGGGRETFFPKGGEKKN 

RPGVFWKQTFFWGGK 


3140 


17041 


A 


3162 


348 


70 


GPPPKKRVFSKTPKWLNKPPQKKKKFI 
FPPPWLGPPKNFLK6PPPFFFFFFFFF 
FFFFFYFFWMGCDR* CSWRHSSPPRLSG 
TPKCSPSVT 


3141 


17042 


A 


3163 


2 


353 


LKTIPLTSTCLTIGSLALAGIPFLTGFY 
SEDHI I * TANVLYTNA* ALiS ITLIATSL 
TSAYSTRI ILLTLTGQPRFPTLTNIETK 
KKKGGPFNRYPLGAQVYGGGQNEKFFLI 
GREII 


3142 


17043 


A 


3164 


1 


221 


PTRPRDCSELRSCHCTPAWATEGDSISR 
KKKKLSTRTAF*YTEAINSLIYSLNIGT 
FKTYFPKIKTYDRHFDF 


3143 


17044 


A 


3165 


26 


383 


IPFYQ*SLI* YTRKKKKKKKKKKKKKKK 
KKKKRGG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=X.ysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


3144 


17045 


A 


3166 


251 


381 


GNIiCAGWARWLTPVVPALWEAEAD*SRG 
QEIGAILiANTVKPHL 


3145 


17046 


A 


3167 


391 


115 


LiFKICISPHAGIWGFFSPLTP*NFFFSLE 
PPIFGRGriAPIFPPPK*RFLSKNPPWF 
IPPPLMGKPFPPPPPVRLGPPIYSFKGA 
PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


SPGILGQKGQIGPIGNHVPGGLAAPVTP 
* FRFKPRLP * GFGPKAS P PLALKPERAQ 
VGGTPPPGPRGPNGQPPPFKENQPGIiGF 
RFRGKIiAEKRGFHIi 


3147 


17048 


A 


3169 


3 


363 


WATALQPGQQSETQSQKKKKKKKKKKRG 
GPPFFFFFKKKIFFSPPGAKNKRGKFF* 
KPPKKKKKIFFFPPPPLKKKKKKKKKKK 
KKKKKKMGGAFLKKPRGAPPPPGKRKLI 
FFFLKGV 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFLGGIV*AHHI 
FTVG I DVDTRAYFTS ATI 1 1 VI PTGVKV 
FS*LATLHGSNMK* SAAVL*ALGFIFLF 
TVGGLTGIVLTNSSLDIVLHDTYYWAH 
FHYVLLIGG^FYIiRYHNHCYPHRRQS I 


3149 


17050 


A 


3171 


390 


27 


QSLTVKSPYPWILIKTKGHH*VMNAGL 
TRYQSLLCENPHIRSEVCITLNPPPLLP 
VSE S P VKHS C VQVLDS VYS S GPNL* DHP 
* TS VDWEL YVDGI S FANPCKVSIiKKMTS 
PAPVTPRS 


3150 


17051 


A 


3172 


127 


310 


KNPGGAKILRGGERKNFFLKRGGKKKHI) 
GIFGKKTFPWGGKKWAKPPK*WRFFSPF 
SP*KFFFSLKALIFFGGFCPFFSPPKKS 
FFSKNSQVFFFSPPFKEKIFPFPPP*NF 
GPPRVFLKGPPPFFFLGVFFFFFFF 


3151 


17052 


A 


3173 


376 


3 


FFFFFIiRRQSLAVTLAGV*RCNLSSLQS 
PPPGIK 


3152 


17053 


A 


3174 


365 


14 


RENFFSPSGGAPKNKPPPPPPPPGGKKK 
I FFQKKKKKKI F YP WKNF P P P P KKKKKN 
P * KPGPLKSQGFFFFKKNQNLPWGPPQ I 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETQLL 


3153 


17054 


A 


3175 


1 


377 


VPLHSSLGGKARLHLRKKKKKNPGFLKN 
FGPIiALLGMGVGNI KGLKGQKGKNPAFG 
AHTGGGCFSLRGRNTPFPKRAEGIICYN 

S PH* KET* KALEPRGYKGL * QAIiALPNL 
KSGKMEHILRGAP 


3154 


17055 


A 


3176 


352 


125 


GHEVLDSSDLPASASQSAGITGVSHHAL 
* EILLSMFETTWACDLLFQNI S F I KSS I 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


3177 


139 


366 


TAHTS *GY* VKNYINLiSFCFFFFFLERW 
HS ALQPGGQGHNRS SLQPWFPGLKQFS C 
LSLPRSWDNGLVPQHLVNFP 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDPFFKNFPKKVFLGVPKNSLF 
*KILTLPPVLNPYPPFFFFPFETVSLCH 
PGWNAWPRLEP 


3157 


17058 


A 


3179 


330 


1 


IVSTLETCYIAYWEEEKDTFITLRIYVI 
GGNGKFLGI* IKQHIKKIIHHDQVGFIP 
GMQGWFNICRSINVIHHISRIKGKSRAQ 
WLTPVI PALWEGDAGGSPEVRSSRPA 


3158 


17059 


A 


3180 


3 


399 


HASAPLQSSLGYRARPCFKKKKKKKKGG 
GGGGFFFPKGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPG I KKP LGKGP P KKRGGK 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=]soIeucine, 
K=Lysine, L^Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=po$slbie 
nucleotide insertion 














KGQKGPHF*KKKKKKGGKKLKKGPF*KP 
FKKPLLGRGVQP *NPHLLGG 


3159 


17060 


A 


3181 


2 


204 


CPTACPFW*NKELLMPKKKKKKKKKKKK 
KKKKKKKKKRGGAPFKNSLGGPHFPGAG 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQ ITFFPPPPPPPEKKPPPF* 
KKKKKKKK3NCSD 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKIjFQKIEEEGCSPNSF 
NEAFI ILIPKPFR* TTKKENF 


3162 


17063 


A 


3184 


152 


387 


YFQGFIYWACICTSFLFFFFFLKREFC 
FVTQVEVQGPNFT* LNPPLFGIiKKFFCL 
TLQIGWNNRPLPPPQVIFCFFK 


3163 


17064 


A 


3185 


22 


336 


YEKCTAIiLQMVS S FIWMEREGTHQYS FY 
RKDFSLASKVNI VS YYLS P I VE* FFFFL 
RGS PFAPQAGGQGPNLGSWKPIiPPGLMP 
FSCLTLPGGWNCRPPPPGPVN 


3164 


17065 


A 


3186 


344 


1 


WVLKKI FFYPGRGGPPF I PP PLGGQGGP 
I P WARGF * P PRGP P PKNGF * KKKKKRGG 
GGPPPGPPPPGGPRGGVPFFLGGGGPRK 
PKKITKKKNPGEKKKTSFKNQKRKTKIK 
TT 


3165 


17066 


A 


3187 


296 


1 


NPKKILTLPKKTKVYKCEGENQVPIIFQ 
GIKNIFWKGIF*PKKEREVCV*SMRHVI 
P VFPKKRGS KRSNKS CC YKDTCTRMF I V 
ALFTIGKTWKQPKY 


3166 


17067 


A 


3188 


2134 


1 


GVAAHACNAS ILGGQGGRI I *GWEFETS 
LANMVKPC 


3167 


17068 


A 


3189 


1 


159 


LQDHPG*HGEPPSIVKIQKLARHGSLRL 
* S * LLGRLRQRMRQETCLNPGAR 


3168 


17069 


A 


3190 


119 


340 


QIKKNRL VS ARGKNNKRK* I YKP* VD I F 
FKEDIQMAGKQMKRDLISLIIREIQIKT 

TI TCYL IHARMGT I TRD 


3169 


17070 


A 


3191 


75 


1 


LSVJTOTFWPGTVAHACNPSTLGG^GG 


3170 


17071 


A 


3192 


343 


1 


IFILGGGPCCSPVFFQFFGGGGGGFFLP 
QNFFPPRGKIFRPFFF*KKKLKRPNWGF 
F*NFFNPPLGFFNLFFFFFKKKPKNFFF 
LGGFPFFFPFPFFFFIiAGGDSLALSPRL 
EC 


3171 


17072 


A 


3193 


105 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF*TPKGPPPPZKKKC*KKKILKGGRG 
KKKKKTPPKRPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 


A 


3194 


1 


365 


FCRDE VIiDFCPDWS PTPGLKQFSHLCLP 
KCWS YRC* PPYPAQGVFLK*HLTKSLS C 
LKLFMASLCLQDKVPAPQPCVKGLSKFF 
LCHLLSTLIPFTHSLFFLFLGWFLRQHR 
SVTQAGVQWR 


3173 


17074 


A 


3195 


1 


362 


GNQPKRIiNAGTYLLFYTIjEGSVPLLIAL 
MYTHKTLWSLNILIiLTLTGQEr.SNC*AN 
NDX* LAYTRAL I VMI PLYGLHL* LPKAH 
VEAP I AGS r * LAAVLLKLGGYGI IRLTL 
ILNPIiTNT 


3174 


17075 


A 


3196 


137 


3 


KGQPRFPPISLKKGSQGKKGFFFFFF*Y 
RIPLCHPGWCPWQSW 


3175 


17076 


A 


3197 


1 


266 


egrgcsel*schctpa*vtvrypvskkk 
kkkkkkgggkgkkkggekntlfgpkkgk 
lirgpqkrgkkigpekkvgnnlkkgi ffr 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla]anine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=]VIethionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/==possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














EKTF 


3176 


17077 


A 


3198 


217 


446 


YWSEHYAMTQVLEGFSYSLQDHFYFCFR 
S IRRI I F YSLI KPS IND * GERELEP I TT 
SQALQIAGRAGRFSSRFKEG 


3177 


17078 


A 


3199 


345 


314 


QPGPEGKIRFFLKI PNLTPSGGKSLKFP 
LFKRVKPENCLSLRG*GCN* P I 


3178 


17079 


A 


3200 


2 


330 


SRHYTPAW*QSKTLSPKKKKKKKKKKKK 
PPFPKNTQKKPKKKKRGGLITWGVKTPPP 
LKTQKMGISPGKKriKKWLFPLKKKKPEG 
NPLF*FRGF*KKKKAPFKKKKNPPK 


3179 


17080 


A 


3201 


2 


385 


FPPFIiGGPQIPKFIiKFFLRAN*NLFGLF 
FLGGVLKKIFLRKPFSLWIKPPPTPFKG 
KKIFFKTFFQKPLFFF*KKKFPFCPPPF 
FFLSRGFFVFFSPKKPFFFFFFFFFFL 


3180 


17081 


A 


3202 


355 


2 


FFFFFSETESCSVAQAGVQWHYIiGSG*A 
LPPRFTP 


3181 


17082 


A 


3203 


156 


1 


LKSLLWEAKVGGLLEVRS SRPA* ATWRD 
FISTKN*KISQVWWYILVWTTWE 


3182 


17083 


A 


3204 


352 


175 


QPRGRPAPAHPP * CPLRIALPC* CPCPA 
CCPPWAEASPSGVQASPARAPACPARAIi 

liNE 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDLQT*KKKNKSPFKILLLID 
NALGHSRVLTERYKDIFRPANTTSFLRP 
MVQELIPTFQS 


3184 


17085 


A 


3206 


1 


367 


EIESIQIDGHTKNKFLGIHLTYLjTKEVK 
DLCKKNYKTLLKEI IDDTEKYDMIjMD*N 
NIVKMAILPKAIYRFDTISTKLLMSFFM 
ELEKIF*NLY*KA*MAETTISKKNKAGG 
ITLLDFKLiYY 


3185 


17086 


A 


3207 


368 


12 


FAQKKKKKKKKKKKKKKKKKKRAPPQN* 
RAPQKPIiKPPPRVFLIPPPLGSPPPPAF 
FWRGGGPPPGPFSKKKKKMRLTGGGRLF 
LIiA 


3186 


17087 


A 


3208 


2 


356 


KYLFSS IPEGKEKMKGIANLFNEIISEM 
CPSIiARDLD IQKTRHANP YNLKKSSPQH 
IIVKLSKVKDKERILKTARKECFITYKR 

TP IRPGMVAYACNP S TLGG * GGW IMRSG 
DQDHPG 


3187 


17088 


A 


3209 


359 


170 


INIKQIiPRCGGTCIiWSQLSRRLRRQDCY 
SI,GGQGCSEL*SSQCSPAWATERDSVSK 
YKINNFL 


3188 


17089 


A 


3210 


295 


122 


VIKTV^NWNKDRHIDQCSRIESPKIHLH 
IDDQL I FDKDVKT PE *R3SINGLFNKWCWE 
N 


3189 


17090 


A 


3211 


218 


466 


ATFDTTIjPSCYQNAPNNRFDDLSD*EQE 
IDTMTVWIILPLRSI.NIVITNPYNI*HH 
QHDLDYTYPDTTGQLVNCAYFINLI.QP 


3190 


17091 


A 


3212 


340 


355 


LGLQG*REGNFGSIiQTSPPGFKRFSGIiS 
IjLSSWDYRV 


3191 


17092 


A 


3213 


254 


40 


WPGTiyiAHACNLSTLGG*GGCITRPGDRD 
HPG^HNIJNN 


3192 


17093 


A 


3214 


198 


25 


YlSrEKEKPVKMLKI lANSLGAVAHAYNPS 
TLGGQGGRIMKSGDRDHPG*HADAiraST 
VL 


3193 


17094 


A 


3215 


376 


3 


PPGVF*RGPPGFRTIiPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
P PSS S SRAPLKKGGP FNPAGGFPLFGGP 
PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isojeucine, 
K=Lysine, L=Leucine, M=^Methionine, 
N=Asparagine, P=ProIine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 
PRPACTIiCISSTYNDP 


3195 


17096 


A 


3217 


3 


387 


HASASSGSRAPPFFFFFNKKTGSKELET 
TRGTVLKTGLFE'KKKKKVLGGPKPPIiGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 
*KNGF*DPGPKKSHIPSWVGGKGSSLLP 
GPLGGPSEVYFSPLRA 


3196 


17097 


A 


3218 


77 


406 


RMARPELGL.PGNLLSNDHIYNGIVTAHA 
FVIIFLIVIPIINGGFGN*LIPLIIGAP 
DMAFPRINNI S F * LLPPSLLLLIASAI V 
EAGAGTG* TVYPPIAGNYSHPGAS VK 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWGPGILKIFVKKKVSS 
LKKKKKKNFPPVFLQGWGGNKNFKGGGL 
KFSKPNLI *FFFPQKEAR*KVFFFFFRS 
PQKDPLREFFFFFLRQVSLCHPDWSAVA 
RSQLTSAS ISRAQGAGRV 


3198 


17099 


A 


3220 


376 


157 


KFFFS P * KFFFS PKPIjKPGGGVGP II P P 
PKKRFFFKNPQGVFKKPPQKKKKIPFQP 
PVNFGPPRDFLKGPPPFFFFFFFF 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
PPLGGFLNPAPKNPPPGPPKKVGFPGGP 
PPPPGGFFFSPPLSF^NPGERVFFGPKI 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAT^ARDLELADAW 


S200 


17101 


A 


3222 


207 


402 


SILM*LCCLFPLPGVTPIDGAPHRSYRE 
CYPVLLDGVMVGWVDKDLAPGIADSLRH 
FKVMREKRI 


3201 


17102 


A 


3223 


309 


3 


YPFFHLIDXAIHPCVCFTKFYKATVTQT 
AWSWYQIRYIDQ*NGTEISEIPPHIYNH 
VICDKHDKNKQWGKDSIjFNKWCWEN*LA 
ICRKLKLDPFPTPYAKI 


3202 


17103 


A 


3224 


3 


382 


LDRERPP FFFWGARHMDI PQLWLS INK 
GHWANFNFLiGYKKKGGWEKKKKKKKKKK 
RGGPP * KKPPGGPK* PPGGKR* I FPYMG 
GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 


3203 


17104 


A 


3225 


133 


2 


FFFETESHSVTRLECSGTISAYCNLCLP 
GSSDSPASAS*AAGI 


3204 


17105 


A 


3226 


349 


2 


AGVPPGWPPLWGGEGGGSPRGGGLKPGP 
PQRGNPFFFKKSQPTPPGGGPPttlPPPW 
GGGAGGSPLPQGQRFQ*TKIGPPPSPRG 
KKKKPPFPKKKKKKKKRKEKSEMPGFMTV 
LNA 


3205 


17106 


A 


3227 


3 


239 


IiNKVGRGCSEPRSRHCTPAWATE*DSIS 
EKKKKKPESRGIIiKVKGLTQALFYIiALW 
LYYLHPTAKQQIWGFFXYFSKP 


3206 


17107 


A 


3228 


205 


1 


IGLKIQNSCPLKDSLKKIKRQATDWRKY 
LQNTSDKASVFIIYKEHLQIiSlyrYKAVDP 
IK*WAKEiyiNKLH 


3207 


17108 


A 


3229 


174 


1 


VQMLEDKSFEETP*FSSETLiVLKIPHVQ 
PGAVAHTCNPSTLGGRGGRITRSGDRDH 
PG 


3208 


17109 


A 


3230 


2 


340 


KNHSVYLLCVFSIPFPTFCFF*F*VFNT 
F*FKLN* IN*FTYIDRVLLCHPGWSAVA 
* S *LTAAIjNSWAQAVPCLSIiLLAHHHAW 
L.I IFKKTAYIIHIWYVWYDTTYPFKVYN 
S 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I^'Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Argiiiine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/'==possible 
nucleotide deletion, \=:possib1e 
nucleotide insertion 


3209 


17110 


A 


3231 


1 


313 


KKSTPYQRGFDP I S PARVPFS I KFFLEA 
ITFLLFDLEDALLLPLP*ALRFrFLFTV 
GGLTGWIiANSSLDIVLHDTYYVAPHFH 
YVLS IGAVLAI IGGFIH* FP 


3210 


17111 


A 


3232 


207 


355 


ELSPPRriKKTLy*IiGMVAHACNPSTLGA 
RGGQIA*AQEFKTSLCMyiAKP 


3211 


17112 


A 


3233 


3 


355 


TTQQ * LI KLTCKQTIAIHNTKGRT* ALI 
LISLIIFIATTNLLGLLPYSLTPTTQIiC 
INLAMAI PL * ADAE VIGPRSKI KNALAH 
LLPQGTPTPLIPILVIMETINLLIEPIA 
LARRL 


3212 


17113 


A 


3234 


2 


355 


KYLINNRLITNQQ*LIKLTSKQMITIHN 
TKGRT*SLILISLIIFIATTNHLGLLPY 
SFTPTTQLSINLAMDIPL*SGAMVIGFR 
S KI KTJALAHFLPQGTPTPL I P I LAI I ET 
ISLLI 


3213 


17114 


A 


3235 


43 


370 


QGCVKGWVLEEQVRRGWILDSSEGKTDL 
KQRGS PGSWPEHVGGWSGVMG* SEAWTG 
QARWLTP I ITALWEAEVGGSLRPGVQWH 
NLGSLQPLPPRFKRFSYFSLPSSWD 


3214 


I7U5 


A 


3236 


15 


356 


LIQPSLKIi(yiISIHMTKGRT*SVILLSLI 
IFIATTNLLGLRPYLFTPTTQLSINLSM 
VIHIi*AGAMVIGFRSNIKNALAHLLPQG 
TPTPLMPILVIIETIRPLILPIALAVRL 
TA 


3215 


17116 


A 


3237 


1 


376 


GTRTNTLT I YQ * WGDGTRE S T YQGHHT P 
PVQKGLRYGI ILFITSEVFFFAGFF*AF 
YHSSLAPTPQri*GHWPPTGITPLNPLEV 
PLLNTCVLLASGVSIT*AHHSLIENNRD 
QIIQAIiLITIVLG 


3216 


17117 


A 


3238 


1 


358 


GTRNG* YTISIA*ALS ITVIGASLTSAQGS 
RIILLTLTGQPRFPTLTNINEHNPTLLN 
PIKRLAAGSRFAGFLITNNISPGCPFQT 
TIPLYLKITDLGVTFLGLLTGLDLNYLT 

NKLIIKA 


3217 


17118 


A 


3239 


258 


390 


RQGLLMLAGLVLNSWPLQSSHLGFPKCW 
DYGREPPCLGN*LIL 


3218 


17119 


A 


3240 


2 


372 


ARARFHHVSRDGLDLLT* *NTHLRLPKC 
WDYRREPLRPGKTFFLKKKKKKTS IFLFF 
REGFKEKSILGIKFFRPTGGVLILTGNH 
GWGCKTGTELLVPSRPPGLAFKI CGLWA 
HDTPHRVRNWL 


3219 


17120 


A 


3241 


3 


283 


HERLWGGWKTGAAGLGRTSSRPTASLTQ 
T*TTMTH*SRTTGCSTGSGRTWTRSRHS 
CAWGMAGLYRVAVASRGPRGMM* PTPPH 
WLCWAWKVP 


3220 


17121 


A 


3242 


170 


3 


I KSQAGLVGFLGP FS FQDSLNLVFVPGF 
FCF*DRVSPCSPGWSAWQSEFTAALV 


3221 


17122 


A 


3243 


2 


385 


ARADVDTPS YLTSDTIMRDI PTGGQVFS 
*LSTLHGSNMK* TAAPLLTLGFI FLFTA 
GGLTGLELTNSWLDIVLHDTYYVGPDFH 
YVLS IGAVFAI IGGLLH* YPLFSGYTLY 
RTYAQIHFAIIFIGEN 


3222 


17123 


A 


3244 


1 


106 


GTRYVGQAHLKCLTSSDSPASTSQSAGI 
TGVSHSA* PASTSQSAGITGVSHSA 


3223 


17124 


A 


3245 


2 


360 


ARANTLTIYQ*WSDVSLENTYQGHHTPP 
VQKGLRYRIILFITSEDYFFSGFL*AF* 
HSSIiS PTPQL *GHWPPTGITPL]NrPLKIVIP 
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nucleotide 
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peptide 
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M 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I^lsoleucine, 
K^Lysine, L~Leucine, M==Methionine, 
N==Asparagine, P==ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \='possibIe 
nucleotide insertion 














LLNTSVLIjASGVSIT*AHHSLIENNRNQ 
IIQALLI 


3224 


17125 


A 


3246 


17 


160 


GG*GCSCSEL*SCHCSPAWVTEQDFDSK 
KKPAILASCLKHLNPLSSH 


3225 


17126 


A 


3247 


236 


2 


WAHiyMTPLSPPSFLiKPVQ*KNFYIYST 
YSLDN*NPSSPKAK:RAPKKSYTrjPYI.HIi 
CVCVCECV+VCVCVCVCVCVWV 


3226 


17127 


A 


3248 


2 


2220 


FFGGGRPSPPQGYPLIiNNHSSPSPPVKIi 
NPGPA*FYPPTKGiCNFPPPQR*PSPPKN 

IKTPPPSFFFSS 


3227 


17128 


A 


3249 


2 


372 


AYTISFIGKISFYGLH**LPKAHVEAPI 
VGS IVLAAELLKLGGFGl IHLTLIFNPL 
TKHIAYPFLGLSL*GISITSSMWLRQTD 
LKSLIAYSSISHIAIiWTAILIQTP*SF 
TGAFFLIIDHG 


3228 


17129 


A 


3250 


293 


56 


EGSPKVIFNKSPPHHLFLFLFFFFIFFF 
*FFFFFFFFFFFFFFFFFFFFFFFFLYLi 
LAMF YLS FF FKQDNQRYRQYS X K 


3229 


17130 


A 


3251 


169 


370 


LKMTELRGAPASKPRGQEPPPHYPCHHH 
HHHHHHFD* VTKGQGPHHWPS PTRDPGW 


3230 


17131 


A 


3252 


22 


156 


GERIGLGLGGQGCSEP*LCHCTIiAWVGD 
TVRPCLKKKKKKKGPPF 


3231 


17132 


A 


3253 


3 


400 


QNQTPLLD^GGLITAVLLLLYLAVLTGG 
ITILIADRSLDSTLFYPAGGGDPILYQH 
IjF*FFGHPEVYILILPGFGIVSHISTNY 
WGGKEPFGPVGMV* AMIAMGFLGFIG * A 
HHIFTVGVDVHTRA*FTSAT 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP*KI FFFPKGLNFWGGGGPKF 
PPPKKKGFFKKSPVGVFPPPGG*KSGPG 
PGFKKPPQKGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWWVQVERP 
TL 


3233 


17134 


A 


3255 


1 


379 


LNLIQRQ*R*V*KFL*LPPQT*KKKKKK 
KKKKKGGGAFKKNLGGAKFNGGRKKKI P 
FLKGGVKKKKAGGGFKKRGKGKKCYLGI 
FEKKPFFGGGKNWBNPPKKIKGLREKKK 
F*GEKGEKKPEKAG 


3234 


17135 


A 


3256 


42 


376 


FCYISLVHHCI YNDLSFERKKNI FVPGQ 
INS I S S I A* EAHCKNKSLLHAVKKKKKK 














KEGRSSLKKEK 


3235 


17136 


A 


3257 


353 


67 


C YPLS PLKFFFS PRS LKFWKGVGP I IS P 
PKKKVPSQNSQEAGFPSPNVLKRRPGPN 
FKTTP * KEKKWPFPPPVKFGPPKESLKR 
PPLFFPFCGLQ 


3236 


17137 


A 


3258 


2 


129 


FHR I S QDGLDLLS S * S ARLGLP KCWDYR 
CEPPRPAKNKIIiLS 


3237 


17138 


A 


3259 


205 


415 


QQKNRRIiLHFKGARTHNSYNRGQPTTPS 
I TAHMYPRLRQSHT I YVLRVHHP * VPSA 
IEGPVSV*ALLHSSTIWAGNLLLIRFH 
PLiAEKS PLIQTVTLCLGANTTLVAGGWA 
LTQNDSKRIAAFSTSRGLGLIIVTIGVN 
QPHIAIiLHICTHAFVKAILFMCSGSriH 
NLSKEQDIRKIGGLLITIP 


3238 


17139 


A 


3260 


3 


393 


S FNLS Tli I TTQEHL * hhhPS * PIiAI I * F 
I STLAETNRTPFDLAEGEAELVSGFMIE 
YAAGPFALFLIAEYTNI I IINTLTTTIL 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argiiiine, S=Serine, 
T— Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \-po$sible 
nucleotide insertion 














LGTTYDALSPELYTTYFVTKTItLIiTSLL 
L* IRTAYPRFRYDQLIHL 


3239 


17140 


A 


3261 


2 


400 


I SDLSEK* FKRLWKLIMEAPEKGKAQC 
KEIQKMTQE VKGEI FKE* IA*RKKKSKF 
QBTLDTLXEMQSAIiESPSNRIKQVEERN 
SEDKDKIPELTQSNKDKGKRIRKYEQSL 
* VAWDYIQ*PNLGI IGIPEEE 


3240 


17141 


A 


3262 


450 


129 


NNIiAFN* V* EFCVliAI KLLKVKRYP IE W 
EKMFANHISDKGLVSGIYKEIiFRIiSNKQ 
AIDLTF*KWAAGHGGSPL 


3241 


17142 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHFSIHTAWSiriYM 
FSPLTI ISRKRMGQPGIVAHTCNPST*G 
G*GRWITRSGVRDQTGQHGKTP 


3242 


17143 


A 


3264 


350 


3 


S PTLLGS KDPNLLGFRFPLWKKGKr IRA 
PLSLGLN*RFFSEWLIP*KPPKNWPGG 
TFLWCFLKRGFPFLSQKKKKKNPG^VA 
HACNPSTLGGRGGRSQGQELETSLANTV 
KTR 


3243 


17144 


A 


3265 


69 


200 


RIjECSGVISAHCNLNLPGSGDSPASAS * 
IiAGITVMVKLPVIAK 


3244 


17145 


A 


3266 


223 


408 


GGFPFFPPGGGEGGNFGELEPLPPGLRK 
FFCLGPPRRGD*GPRSSSPGSPWVFKK!Sr 

GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFI I YFYRQSLA* VHWNNPSSL* 
PRTPGIiKHTPVPSLLISWDYGRTPPHLT 
NFCIFFDRGSFFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGCHS I PQAGVQWHNHGLLQPQPPGLR* 
SSCLSLPSSWHY 


3247 


17148 


A 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKSPPPNQPKGAQ 
TIjPLKI WDKMGGGGGLALW* KAPPGNF 
KGPPGGKPMEQP *IiGPGPPLKWKG6LPH 
QKGGFSKAPGEKKKGGEGRL 


3248 


17149 


A 


3270 


422 


183 


ETEWSLFKVI ITEKSPNLEKDTNIQVQ 

ESYRTPSRPILMKTTSRHLIIILPKVNN 

TERIL^MQQDRGNIMIQWSSGCSA 


3249 


17150 


A 


3271 


3 


35 


KNNSL 1 1 PTI I AT I TLLNLYF YLS PLLY 
**SSSPPS 


3250 


17151 


A 


3272 


155 


1 


iCDFFFFFLRQSFTLVAQAAAEWCDLCSL 
QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


A 


3273 


413 


71 


PPSTGLPIiTEEYEVSFPPFPL*KFFPPP 
SGLFFGGGVPPFPPPPKKGFPPKYPRLV 
FKGPLLGGGGLPPPPP*ILPPLGSFLPA 
PPLFFFFLFFL 


3252 


17153 


A 


3274 


336 


45 


DRVLSCSPAWRAiyiARSHDFG*LQLPPPR 
VKVFSCLSNPSSWDPRHVPPRKGNFVFL 
VKTGNPWNLGGQGCRERRLCPCIPAWGT 
DKDSVSKKKKKS 


3253 


17154 


A 


3275 


206 


366 


SVFFFVLFGFFYGALLCFPNPWLECSGT 
ILVHCNLHFPGSKDS SAS PS * VAGT 


3254 


17155 


A 


3276 


334 


75 


ENTRRVERRRRTHI I YYNFF*EFMNRFF 
FFIFFDGRPSCVPFFFPLFYFFFFFFFF 
FFFFFFFFPFFFFWFSARSFIYFLFPRH 

VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFPFFFFW*FRG*T 
HWNGDACMCNLTKS 


3256 


17157 


A 


3278 


2 


109 


YHIVKPSP*PLTGALSALLMTSGLAM*F 
HFHSITL 
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